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ALIGNMENTS 



RESULT 1 
ABB81074 

ID ABB81074 standard; protein; 1163 AA. 
XX 

AC ABB81074; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Rat neurotransmitter receptor protein Nogo-A. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 

KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 

KW vulnerary; cerebroprotective ; anti-tumour; antidiabetic; anticonvulsant; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 

KW osteopathic; vasotropic; nephrotropic; cytostatic; antigen; gene therapy; 

KW neurotransmitter receptor; rat; receptor. 



XX 

OS Rattus norvegicus. 
XX 

PN US2002072493-A1. 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2001US-008 9334 8 . 
XX 

PR 19-MAY-1998; 98IL-00124500 . 

PR 21-JUL-1998; 98WO-US014715 . 

PR 22-DEC-1998; 98US-002 18277 . 

PR 19-MAY-1999; 99US-00314 161 . 
XX 

PA (YEDA ) YE DA RES & DEV CO LTD. 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR N-PSDB; ABN86600. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in the 

PT central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides . 

XX 

PS Example 5; Page 44-47; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central/peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease is 

CC diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson f s 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt- Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma- 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates ) , Charcot-Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents the rat 

CC neurotransmitter receptor protein Nogo-A, an example of NS-specific 



CC antigen 
XX 

SQ Sequence 1163 AA; 



Query Match 100.0%; Score 5848; DB 5; Length 1163; 

Best Local Similarity 100.0%; Pred. No. 3.9e-297; 

Matches 1163; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I II I I I I I I II I I I I I I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAP PAAPERQPSWERSPAAPAPSLP 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 61 PAAGLSAAAVP PAAAAPLLDFS SDSVP PAPRGPLPAAP PAAPERQPSWERS PAAPAPS LP 120 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

I I I II I I I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I I I I I I I I I 1 I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I 
Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Qy 301 AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMSWA 360 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMSWA 360 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I M I I I I I I M I I I I I I II I I I I I I I I I 
Db 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

Qy 421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I II I I I I I II I I I I I II I I I I I I I I I I I I I I 
Db 421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II 
Db 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I II I II I II I M I 
Db 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II 
Db 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Qy 721 PVDLFSDDS I PEVPQTQEEAVMLMKESLTEVS ETVAQHKEERLSAS PQELGKP YLES FQP 780 



Db 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 7 80 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

I | M | | I I I I 1 I I 1 I I I I I I I I I I I I I I I I I I I I I 1 II I I I I I I I I I I I I I I I I M I I I I 
Db 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I M I 1 I I I I I I I 
Db 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Qy 901 DEVHVSDEFSENRSSVS KASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I I I I I I 
Db 901 DEVHVSDEFSENRSSVS KASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

M I I I I I I I I I I I I I I I M M I II I I I I I I I II I I I I I I I I I I I I I I I I I 1 I I I I I I I I I 
Db 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

Qy 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M 
Db 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

Qy 1081 FLVDDLVDSLKFAVLMWFTWGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I II M 
Db 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

Qy 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I M I I I I I I II I I I I I I I I I 
Db 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 



RESULT 2 


AAY71310 


ID 


AAY71310 standard; protein; 1163 AA. 


XX 




AC 


AAY71310; 


XX 




DT 


02-NOV-2000 (first entry) 


XX 




DE 


Rat neurite growth inhibitor Nogo A. 


XX 




KW 


Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 


KW 


central nervous system; neoplastic disease; antiproliferative; glioma; 


KW 


antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 


KW 


degenerative nerve disease; Alzheimer f s disease; Parkinson's disease; 


KW 


hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 


KW 


psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 


KW 


structural plasticity; screening. 


XX 




OS 


Rattus sp. 


XX 




FH 


Key Location/Qualifiers 


FT 


Inhibitory-site 1. .171 


FT 


/note= "Inhibits NIH 3T3 fibroblast spreading" 


FT 


Modif ied-site 30 


FT 


/note= "Casein kinase II site" 



FT 


Region 




31. .58 












FT 






/note= "Acidic region" 












FT 


Region 




31. .57 












FT 






/note= "Region specifically described 


in 


claim 


16 


Tl 


FT 


Region 




172. .259 












FT 






/note= "This region is not essential 


for 


inhibitory 


FT 






activity" 












FT 


Modif ied- 


-site 


233 












FT 






/note= "Protein kinase C (PKC) 


site" 










FT 


Modif ied- 


-site 


242. .244 












FT 






/note= "Asn is N-glycosylated" 












FT 


Modif ied- 


-site 


291 












FT 






/note= "Protein kinase C (PKC) 


site" 










FT 


Modif ied- 


-site 


295 












FT 






/note= "Protein kinase C (PKC) 


site" 










FT 


Misc-dif f erence 


404 












FT 






/note= "Encoded by TTG" 












FT 


Modif ied- 


■site 


436 












FT 






/note= "Protein kinase C (PKC) 


site" 










FT 


Modif ied- 


•site 


468. .470 












FT 






/note= "Asn is N-glycosylated" 












FT 


Modif ied- 


■site 


484 












FT 






/note= "Protein kinase C (PKC) 


site" 










FT 


Modif ied- 


site 


488 












FT 






/note= "Protein kinase C (PKC) 


site" 










FT 


Modif ied- 


■site 


502 












FT 






/note= "Casein kinase II site" 












FT 


Inhibitoi 


;y-site 


542. .722 












FT 


Modif ied- 


site 


576 












FT 






/note= "Casein kinase II site" 












FT 


Peptide 




623. .640 












FT 






/note= "used as immunogen to generate 


antibody 


AS 


472" 


FT 


Modif ied- 


site 


626 












FT 






/note= "Protein kinase C (PKC) 


site" 










FT 


Modif ied- 


site 


694. .696 












FT 






/note= "Asn is N-glycosylated" 












FT 


Modif ied- 


site 


715 












FT 






/note= "Casein kinase II site" 












FT 


Peptide 




762. .1163 












FT 






/note= "used as immunogen to generate 


antibody 


AS 


Bruna 


FT 


Modif ied- 


site 


784 












FT 






/note= "Protein kinase C (PKC) 


site" 










FT 


Modif ied- 


site 


821 












FT 






/note= "Protein kinase C (PKC) 


site" 










FT 


Modif ied- 


site 


850 












FT 






/note= "Protein kinase C (PKC) 


site" 










FT 


Modif ied- 


site 


855 












FT 






/note= "Protein kinase C (PKC) 


site" 










FT 


Modif ied- 


site 


863 












FT 






/note= "Casein kinase II site" 












FT 


Modif ied- 


site 


868 












FT 






/note= "Protein kinase C (PKC) 


site" 










FT 


Modif ied- 


site 


893 












FT 






/note= "Protein kinase C (PKC) 


site" 










FT 


Modif ied- 


site 


912. .914 












FT 






/note= "Asn is N-glycosylated" 












FT 


Modif ied- 


site 


925. .927 













FT / no te= "Asn is N-glycosylated" 

FT Modified- site 954 

FT /note= "PKC and casein kinase II sites" 

FT Modified-site 956 

FT /note= "PKC and casein kinase II sites" 

FT Region 975. .1162 

FT /note= "This region is not essential for inhibitory 

FT activity" 

FT Region 976. .1163 

FT /note= "C-terminal common region found in Nogo A, B and C 

FT isoforms" 

FT Domain 988. ,1023 

FT /label= Transmembrane_domain 

FT /note= "C-terminal hydrophobic region specifically 

FT described in claim 16" 

FT Modified-site 1024 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 1071. .1073 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 1073 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 1089 

FT /note= "Protein kinase C (PKC) site" 

FT Domain 1090. .1125 

FT /label= Transmembrane_domain 

FT /note= "C-terminal hydrophobic region specifically 

FT described in claim 16" 

FT Modified-site 1141. .1143 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 1143 

FT /note= "Protein kinase C (PKC) site" 

XX 



PN WO200031235-A2 . 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 

PR 06-NOV-1998; 98US-010744 6P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN MS. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 

DR N-PSDB; AAD01173. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Claim 3; Fig 2A; 122pp; English. 
XX 

CC The present sequence is a rat Nogo A protein which is a potent neural 

CC cell growth inhibitor and is free of all central nervous system (CNS) 

CC myelin material with which it is natively associated. The protein was 

CC derived from a cDNA generated by fusing R018U37-3, R1-3U21 cDNAs isolated 



CC from hexanucleotides-primed rat brain stem/spinal cord library, and 01il8 

CC cDNA from an oligo d(T) -primed rat oligodendrocyte library. Nogo proteins 

CC and fragments displaying neurite growth inhibitory activity are used in 

CC the treatment of neoplastic disease of the CNS e.g. glioma, glioblastoma, 

CC medulloblastoma, craniopharyngioma, ependyoma, pinealoma, 

CC haemangioblastoma, acoustic neuroma, oligodendroglioma, menagioma, 

CC neuroblastoma or retinoblastoma and degenerative nerve diseases e.g. 

CC Alzheimer's and Parkinson ! s diseases. Therapeutics which promote Nogo 

CC activity can be used to treat or prevent hyperprolif erative or benign 

CC dysprolif erative disorders e.g. psoriasis and tissue hypertrophy. 

CC Ribozymes or antisense Nogo nucleic acids can be used to inhibit 

CC production of Nogo protein to induce regeneration of neurons or to 

CC promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. The animal 

CC models can be used in diagnostic and screening methods for predisposition 

CC to disorders and to screen for or test molecules which can treat or 

CC prevent disorders or diseases of the CNS. Note: The present sequence 

CC designated as SEQ ID NO: 2 is stated to be the same as the sequence shown 

CC in Fig. 13 (see AAY71384) of the specification. However, this sequence 

CC does not match the sequence given in Fig. 13. SEQ ID numbers 35-42 are 

CC referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However, the specification does not include sequences for 

CC these SEQ ID numbers 

XX 

SQ Sequence 1163 AA; 

Query Match 100.0%; Score 5846; DB 3; Length 1163; 

Best Local Similarity 99.9%; Pred. No. 5e-297; 

Matches 1162; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I II I I I It I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 P AAG L S AAAVP P AAAAP LLDFSSDS VP PAP RG P L P AAP P AAP E RQ P S W E RS P AAP AP S L P 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 61 P AAGL S AAAVP PAAAAP L LD FS S D S VP P APRG P L P AAP PAAP ERQ P S WERS P AAP AP S L P 120 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I II I I I I I t I I I 
Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPIiAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I 
Db 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVS FKEHGYL 240 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I I I I I I I I I I I I I I I I I II I I I II I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Qy 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 301 AI LVENTKEEVI VRS KDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMS WA 360 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESJCVDRKCLEDSLEQKSLGKDSEGR 420 

I I I I I I I I I I I I I I II I I I I II I M I I II I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I 
Db 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANV^SKVDRKCIEDSLEQKSLGKDSEGR 420 



Qy 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M I I I I 
Db 421 NEDASFPSTPEPVKDSSRAYITCAS FT SATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I 1 I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I II I I I I I I 
Db 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEE^MNVALKALGTKEGIKEPESF 660 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 72 0 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Qy 721 PVDLFS DDS I PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSAS PQELGKPYLES FQP 780 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I 
Db 721 PVDLFSDDS I PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSAS PQELGKPYLES FQP 780 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I M II I I I 
Db 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

Qy 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

Qy 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 114 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1081 FLVDDLVDSLKFAVLMWFTWGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

Qy 1141 NKSVKDAMAKIQAKI PGLKRKAD 1163 

I II I I II I I I I I I I I I I I I I I I I 
Db 1141 NKSVKDAMAKIQAKI PGLKRKAD 1163 

RESULT 3 
AAY71557 

ID AAY71557 standard; protein; 1162 AA. 



XX 

AC AAY71557; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat Nogo A truncated protein used in the construction of mutant Nogo-A. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS ; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp. 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 

PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Example; Page; 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of all 

CC central nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma, acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperprolif erative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders or diseases of the CNS. The present sequence 

CC is a truncated form of rat Nogo A protein shown in AAY71310, which is 

CC used in the construction of mutant Nogo-A. Nogo-A is composed of His- 

CC tag/T7-tag/vector/Nogo-A sequence aa 1-1162. Nogo A deletion mutants were 



CC used for mapping the inhibitory sites of Nogo protein. Major inhibitory 

CC region was identified in the Nogo A sequence from amino acids 172-974, 

CC particularly amino acids 542-722. In addition, N-terminal region 1-171 

CC was found to be inhibitory to NIH 3T3 fibroblast spreading. Note: The 

CC present sequence is not given in the specification but is derived from 

CC rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 are referred 

CC in claim 32 and SEQ ID NO: 29 in disclosure of the specification. 

CC However, the specification does not include sequences for these SEQ ID 

CC numbers 
XX 

SQ Sequence 1162 AA; 

Query Match 99.9%; Score 5840; DB 3; Length 1162; 

Best Local Similarity 99.9%; Pred. No. le-296; 

Matches 1161; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I II I I I I I I I I I II I I I I I I I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I 1 I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I II I 
Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

I I I I I II I I I II I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 181 AASEPVIPSS7VEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I II II I M I I I I II I I I I I I II I I I I I I 
Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Qy 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I 
Db 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 



Db 



361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCIEDSLEQKSLGKDSEGR 42 0 




Qy 



421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 48 0 




Db 



421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 



QY 



481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 




Db 



4 81 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 



Qy 



541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 




Db 



541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 



Qy 



601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 



I I I I t I I I I I I I I I I I I I I I I I I I M I I I II I t I II I I I I I I I I I I II I I I I I I I I I I I I 

Db 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS ETVAQHKEERLSASPQELGKPYLESFQP 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 721 PVDLFSDDS I PEVPQTQEEAVMLMKESLTEVS ETVAQHKEERLSASPQELGKPYLESFQP 780 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

I I I I I I I I I I M I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I II I 
Db 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIAMIQSGADSLPCLELPCDLSFKNI YPK 900 

I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNI YPK 900 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

II I I I I I I I I I I I I I M M I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I II 
Db 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

Qy 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 108 0 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II 
Db 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 108 0 

Qy 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

I I I I I I I I II I I II I I I I I M I I I I I I I I I I I I I I I I I I I II I I II II I I I I I I I I I I II 
Db 1081 FLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

Qy 1141 NKSVKDAMAKIQAKIPGLKRKA 1162 

I I I I I I I I I I 1 I I I I I I I I I I I 
Db 1141 NKSVKDAMAKIQAKIPGLKRKA 1162 



RESULT 4 
AAY71384 

ID AAY71384 standard; protein; 1163 AA. 
XX 

AC AAY71384; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Alternative version of rat neurite growth inhibitor Nogo A. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening. 



XX 

OS Rattus sp . 
XX 

FH Key Location/Qualifiers 

FT Inhibitory-site 1. .171 

FT /note= "Inhibits NIH 3T3 fibroblast spreading" 

FT Modified-site 30 

FT /note= "Casein kinase II site" 

FT Region 31. .58 

FT /note= "Acidic region" 

FT Region 172. .259 

FT /note= "This region is not essential for inhibitory 

FT activity" 

FT Misc-dif ference 223 

FT /label= Unknown 

FT /note= "There is Leu at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 233 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 242. .244 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 291 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 295 

FT /note= "Protein kinase C (PKC) site" 

FT Misc-dif ference 404 

FT /note= "There is lie at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 436 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 468. .470 

FT /note= "Asn is N-glycosylated" 

FT Misc-dif ference 469 

FT /label= Unknown 

FT /note= "There is Lys at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 484 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 488 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 502 

FT /note= "Casein kinase II site" 

FT Inhibitory-site 542. .722 

FT Modified-site 576 

FT /note= "Casein kinase II site" 

FT Peptide 623. .640 

FT /note= "used as immunogen to generate antibody AS 472" 

FT Modified-site 626 

FT /note= "Protein kinase C (PKC) site" 

FT Misc-dif ference 661 

FT /note= "There is Asn at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 694. .696 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 715 

FT /note= "Casein kinase II site" 

FT Peptide 762. .1163 



FT /note= "used as immunogen to generate antibody AS Bruna" 

FT Modified-site 784 

FT /note= "Protein kinase C (PKC) site" 

FT Misc-dif ference 820 

FT /note= "There is Leu at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 821 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 850 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 855 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 863 

FT /note= "Casein kinase II site" 

FT Modified-site 868 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 893 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 912. .914 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 925. .927 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 954 

FT /note= "PKC and casein kinase II sites" 

FT Modified-site 956 

FT /note= "PKC and casein kinase II sites" 

FT Region 975. .1162 

FT /note= "This region is not essential for inhibitory 

FT activity" 

FT Region 976. .1163 

FT /note= "C-terminal common region found in Nogo A, B and C 

FT iso forms" 

FT Domain 988. .1023 

FT /label= Transmembrane_domain 

FT /note= "C-terminal hydrophobic region" 

FT Modified-site 1024 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 1071. .1073 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 1073 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 1089 

FT /note= "Protein kinase C (PKC) site" 

FT Domain 1090. .1125 

FT /label= Transmembrane_domain 

FT /note= "C-terminal hydrophobic region" 

FT Modified-site 1141. .1143 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 1143 

FT /note= "Protein kinase C (PKC) site" 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US02 6160 . 
XX 



PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Claim 3; Fig 13; 122pp; English. 
XX 

CC The present sequence is an alternative version of rat Nogo A protein 

CC which is a potent neural cell growth inhibitor and is free of all central 

CC nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma, acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperprolif erative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders or diseases of the CNS. Note: The present 

CC sequence is an alternative version of the Nogo A sequence shown in Fig. 

CC 2A (see AAY71310) . SEQ ID numbers 35-42 are referred in claim 32 and SEQ 

CC ID NO: 29 in disclosure of. the specification. However the specification 

CC does not include sequences for these SEQ ID numbers 

XX 

SQ Sequence 1163 AA; 

Query Match 99.6%; Score 5823; DB 3; Length 1163; 
Best Local Similarity 99.7%; Pred. No. 7.9e-296; 

Matches 1159; Conservative 0; Mismatches 4; Indels 0; Gaps 0; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I I M I I I I I I I I I! I I I I I I I I 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVP PAAAAPLLDFSSDSVPPAPRGPLPAAP PAAPERQPSWERSPAAPAPSLP 12 0 



Db 



61 



PAAGLSAAAVP PAAAAPLLDFSSDSVPPAPRGPLPAAP PAAPERQPSWERSPAAPAPSLP 



120 



Qy 



121 



PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 



180 



Db 



121 



PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 



180 



QY 



181 



AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 
I I I I I II I I I I I I I I M I I I I I I I I I I I I li I I I II I II I I I I II I I I I I I I I I I t I I I 



240 



Db 



181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASXPSLSPLSTVSFKEHGYL 240 



Qy 


O A 1 

z4 1 




300 




1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 




Db 


241 


GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 


300 


Qy 


JUx 


a T T \/TT , iM r P'E<?"F?Tn7'T17D Q "KTi^TTnT wr* Q. A A T uq pnl? Q P\/y^K"P , nRW c ; PFKTMHT F r MFMDM < ^\/\/A 
J\± J_j VIjIM 1 rviLiilj VIVKj JaJJiS.IjL'Ij V ^o/\r\XiflO Jr^lliO 1: V oi\HiL^r\V V c> t. £jl\. L irLLJ X r IN dl v l^i. v lo V VA 


360 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


301 


AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 


360 


Qy 


361 


PVREEYADr Krr EyAWrj VJ\D1 I ibb KDVJaAAKAIN V lib ft VDK1\U 


4 9 0 

TI Z. \J 






i i i i i i i m i i i i i i i i i i i i i m i i i i i i i i i i i i i i i i i i i i i i i i i i m i i m i i i i 




Db 


361 


PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 


420 


Qy 


A O 1 

421 


NEDAor r b 1 rhrVKUooKAi 1 1 LAor 1 bAl EjO 1 IAN 1 r r'XjXj.EjL'nl blMNrvl UL^J\J\Xi!j1^K]\/\ 


a ft n 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENXTDEKKIEERKA 


480 


Qy 


4 81 


QI 1TEKT SPKx SNr r LVAVyDbEAJJ I V 1 1 JJ1 XibrW 1 CiAAV bJNJMlr £j\3h 1 r^XjVyiii/\L.EiO.EjXj 


^ a n 

O U 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


481 


QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 


540 


Qy 


541 


NEATGTKIAYETKVDLVQTSEAlQhjbLYr 1 A^LiUFbr k&AiijAl r or V hful VMrxAt'XjN b Xi 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 




Db . 


541 


NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 


600 


Qy 


601 


LPSAGASVVQPSVSPLEiAPPPVS YDblKLEPEWPPF YLi^AiyLN VAIjJ\A1j(j1 J\rjCjll\Ejriijbr 


^ £ n 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 




Db 


601 


LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 


660 


Qy 


bbl 


iNAAVyili 1 hiAr lib XAUULiX J\Ij J. J\ijbl HjirbrJJr blN I biij±/vl\r ili\o v r £jri/\i!jij v iLJJoorrjOiL 






1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 




Db 


661 


MAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 


720 


Qy 


721 


PVDLFSDDSI PEVPQTQEEAVMLMKEbijI EVbhil VAynJ\JtLrjKLbAb f^JtLhK^rxlr 1 liEjbc 


7 ft n 

/ou 






i M 1 1 1 1 ii i ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 ii i ii 1 1 1 1 1 1 1 




Db 


721 


PVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 


780 


Qy 


TOT 

7 81 


NLHSTKDAASNDI PTLi KKEK1 b-byMhiliir JN 1 Al i bJNIiJUJjJjbbJ\EjJJi\l rvtbrj X r bJJbbfXiii 


ft a n 

O 1 u 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


781 


NLHSTKDAASNDI PTLTKKEKISLQMEEFNTAIYSNDDLSSSKEDKIKESETFSDSSP IE 


840 


Qy 


O A 1 
O 4 1 


TTr\n?nmTmro7\vnnoDVT a lttt vt 1 t\t rrven vo tp t iMTncra P ot pr*T T? T DTHT Q TTPTM TVP^T 
XXUhjr rl r V bAJ\UUb Jr rvixArvllj I 1 DIjIIj V bJJl\b ij±/\iN ±^oL3/\JJo J_tr ^J_ici-Ulr^jjj_io r x X it rv 


900 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 M 




Db 


841 


IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 


900 


Qy 


9 01 


DEVHVbDEr bENKbb VbnAblblrbJNI VbAXjhirvJ- iljJ^^bX VJ\bJ\bXil J\rLMXtjJ\J\Xjr bUl CjixiIjU 


j \j \j 






I I I I I I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


901 


DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 


960 


Qy 


961 


RSLSAVLSAELSKTSWDLLWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 


1020 




1 | I I I I 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


961 


RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 


1020 


Qy 


1021 


VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 


1080 






1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II M 1 1 1 1 1 1 




Db 


1021 


VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS7VLGHVNSTIKELRRL 


1080 



Qy 1081 FLVDDLVDSLKFAVLMWFTWGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

I I I I I I I I I I I I II I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1081 FLVDDLVDS LKFAVLMWVFT YVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLA 1140 

Qy 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I I I I I I I I I II M I I I I 
Db 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 



RESULT 5 
AAY71560 

ID AAY71560 standard; protein; 974 AA. 
XX 

AC AAY71560; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat Nogo A protein fragment used in the construction of mutant NiAext. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer* s disease; Parkinson ! s disease; 

KW hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp. 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 

PR 06-NOV-1998; 98US-0107 4 4 6P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Example; Page; 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of all 

CC central nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma, acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 



CC degenerative nerve diseases e.g. Alzheimer's and Parkinson 1 s diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperproliferative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders or diseases of the CNS. The present sequence 

CC is a fragment of rat Nogo A protein shown in AAY71310, which is used in 

CC the construction of mutant NiAext . The mutant is composed of His-tag/T7- 

CC tag/vector/Nogo-A sequence aa l-974/T7-tag . Nogo A deletion mutants were 

CC used for mapping the inhibitory sites of Nogo protein. Major inhibitory 

CC region was identified in the Nogo A sequence from amino acids 172-974, 

CC particularly amino acids 542-722. In addition, N-terminal region 1-171 

CC was found to be inhibitory to NIH 3T3 fibroblast spreading. Note: The 

CC present sequence is not given in the specification but is derived from 

CC rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 are referred 

CC in claim 32 and SEQ ID NO: 29 in disclosure of the specification. 

CC However, the specification does not include sequences for these SEQ ID 

CC numbers 
XX 

SQ Sequence 974 AA; 

Query Match 84.1%; Score 4921; DB 3; Length 974; 

Best Local Similarity 99.9%; Pred. No. 8.8e-249; 

Matches 973; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 


60 






1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


1 


MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 


60 


Qy 


61 


PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 


120 




M I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


PAAGLSAAAVP PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 


120 


QY 


121 


PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 


180 


QY 


181 


AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 


240 






1 II 1 1 1 i 1 1 1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 II M. 1 1 1 




Db 


181 


AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 


240 


Qy 


241 


GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 


300 




1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 




Db 


241 


GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 


300 


Qy 


301 


AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMS WA 


360 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


AI LVENTKEEVI VRS KDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMS WA 


360 


Qy 


361 


PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCIEDSLEQKSLGKDSEGR 


420 



Qy 


421 


NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 


480 




1 1 M I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


A O "1 

4^1 


Mtrn a cyDCTDTTDU^ric; QDAYTTPa^ FT 1 ^ ATP QTT ANTFPTiT.FDHT ^ FNKTTlFKKT FFRKA 


4 80 


Qy 


481 


QIITEKTSPKTSNPFLVAVQDSE^YVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 


540 




1 1 1 1 1 1 1 1 1 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 




Db 


A Q T 

4 o ± 


r\T T T 1 U 1 VP Q D W QMDTTT VTiUnnQyAnY^/TTnTT q K^/T 1 F a AV^NMPFnT.T PHTA/OF. APF ^ F.T i 


540 


Qy 


541 


NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 


600 




1 II II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


MT7 , 7\ r nr* r nVT A VTrTtrvrnT \rr\ r V Cjt?7\Tni?QT VPT A.OT PPQ FFF AF ATP Qpi/T P"nT"\VMF APT M^T. 


600 


Qy 


601 


LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


LiFbACjAb V Vyir b Voir LiEjAr i: r V b I Uol r\ J_ill.tr HiJN reel rjll±ftrwi V/\ljiVrtJ_i^ J. j\Hio± iMjitEjO c 


66 0 
uuu 


Qy 


661 


NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 


720 




1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


xtt\ 7\ \ rr^TprnTp a dvt o T 7\ rr\T T VTT VT CTPDC DTMTCMVQ T? T A VFF'&Tq^/'PF'H' AFT "\7TT H ^ q PIT Q T7 

NAAVQiirrlAJr I J.olAUJJLl J\ijl l\J_ib 1 tiro r uc ovi 1 b£j_LAJ\r Cjj\o virnriMJiiij v__UDDr__.o__ 


7 ? n 


Qy 


721 


PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 


780 




1 1 1 M II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 




Db 


721 


PVDLFSDDSI PrjVrQl QbhlAVM-LMivhib-Ll HiVbUjl VAynJ\ilti-ir\_jbAb ryrj±j(jJ\Jr I -Unor y±r 


i ft n 


Qy 


781 


NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 


840 




I | | | M | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 




Db 


7 81 


NLHSTKDAASNDIPTLTKKEKISLyMhjJtjr N 1A1 Y bJNIDDJjljbbttiitDftl J\JlibrLl roJJoorlrj 


ft A 0 

O H. U 


Qy 


841 


IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 


900 




1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M II 1 




Db 


o 4 1 


t t nTTT'DTTrwc! a vnnc dvt a viT VPT^T F\/"q TiT<r q F T AMTnq(^AD c IT,Pr , T.FT.PPnT. c lFKr\IT YPK 


900 


Qy 


q n i 


nT?AnTA/QnFF^FMPc: q^/Q va c j q pqijvc: AT F POT FMH S T VK S K S T. T KF AFKKT.P SDT EKED 


960 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 U 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 ! 




Db 


901 


DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 


960 


Qy 


961 


RSLSAVLSAELSKT 974 




Db 


961 


1 1 II 1 1 1 1 1 1 1 II 1 
RSLSAVLSAELSKT 97 4 





RESULT 6 
AAU04591 

ID AAU04591 standard; protein; 1192 AA. 
XX 

AC AAU04591; 
XX 

DT 26-SEP-2001 (first entry) 
XX 

DE Human Nogo protein. 
XX 

KW Human; Nogo receptor; axonal growth; iramunogen; antibody; nogo protein; 

KW cranial trauma; cerebral trauma; spinal cord injury; stroke; 

KW demyelinating disease; multiple sclerosis; monophasis demyelination; 

KW encephalomyelitis; multifocal leukoencephalopathy ; panencephalitis; 

KW Marchiaf ava-Bignami disease; pontine myelinolysis ; adrenoleukodys trophy; 

KW Pelizaeus-Merzbacher disease; Spongy degeneration; Alexander's disease; 



KW Canavan's disease; metachromatic leukodystrophy; viral infection; 

KW Krabbe's disease. 

XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT Domain 1054. .1119 

FT /label= Lumenal_extracellular_doraain 

FT /note= "This sequence is specifically claimed" 

FT Peptide 1055. .1094 

FT /label= Pepl 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1064. .1088 

FT /labels Pep2 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1074. .1098 

FT /label= Pep3 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1084. .1108 

FT /label= Pep4 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1095. .1119 

FT /label= Pep5 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

XX 



PN WO200151520-A2. 
XX 

PD 19-JUL-2001. 
XX 

PF 12-JAN-2001; 2001WO-US00104 1 . 
XX 

PR 12-JAN-2000; 2000US-0175707P . 

PR 26-MAY-2000; 2000US-0207366P . 

PR 29-SEP-2000; 2000US-0236378P . 
XX 

PA (UYYA ) UNIV YALE. 
XX 

PI Strittmatter SM; 
XX 

DR WPI; 2001-442138/47. 

DR N-PSDB; AAS09453. 
XX 

PT Novel Nogo receptor protein useful for identifying modulator of Nogo 

PT protein or Nogo receptor protein, which is useful for treating central 

PT nervous system disorders. 
XX 

PS Example 1; Page 101-104; 109pp; English. 
XX 

CC The sequence is the human Nogo protein, a 250kDa myelin-associated axon 

CC growth inhibitor. The invention relates to the use of the nogo receptor, 

CC nogo protein, their nucleic acids, vectors expressing them and antibodies 

CC against them, to isolate agents which block nogo receptor mediated axonal 



CC growth. The agent is useful for treating a central nervous system 

CC disorder which is a result of cranial or cerebral trauma, spinal cord 

CC injury, stroke or a demyelinating disease selected from multiple 

CC sclerosis, monophasis demyelination, encephalomyelitis, multifocal 

CC leukoencephalopathy, panencephalitis, Marchiaf ava-Bignami disease, 

CC pontine myelinolysis , adrenoleukodystrophy, Pelizaeus-Merzbacher disease, 

CC Spongy degeneration, Alexander's disease, Canavan's disease, 

CC metachromatic leukodystrophy, viral infection and Krabbe ' s disease 

XX 

SQ Sequence 1192 AA; 

Query Match 75.3%; Score 4403.5; DB 4; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 1.3e-221; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I :: I I 1 M I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFS S DSVP PAPRGPLPAAPPAAPERQPSWERS PAA P 115 

I I I I I I I I I I M I I I I : I I : I M I I I I I I I I I I I I I I I ! I I I I : I I : I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Ml I I I I I I I I I I I I I I I I I I I II I I I II II I I I I II I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I II I I I I I I I I M I I I I I I I I I I I II I : I : I I I I I I I I I II I I I I I I I 

Db 17 9 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

M I I I I I I I I I I I I I I I : I I I : : I : : I I I I I : I : I :: II I I I I I II I 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KED 339 

M I I M I I I I I I I : : I I : I I : I I : : I I : I : I I : I I : I I I Ml 
Db 298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

I I I I I I I I I ::: I I I : I I I I I II I I I I : I I II I : I I I : M I : I : 

Db 358 EWS S EKAKDS FNEKRVAVEAPMREE YADFKP FERWEVKD S KEDS - DMLAAGGKI ESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

I I I I I : I I Mill: II II Ml I I II M I Ml I I II I II I Mill 
Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 476 

Qy 4 55 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I I I I I II I I I II I II II M I II : I I I II I I I I I I I I I I I I I II I II I M 
Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I I I I M I II I I I I I I I I I I I II I II I I I I I I I I II I M I I I I I I M I I I II Ml 
Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I M I II M I I I II I II I II I I II II I M I I I I I I M I I II I I I I M M I I MM 



Db 



597 CPSFEESEATPSPVLPDIVMRAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 





U O T 


Db 


656 


yy 


U J J 


uu 


716 


wy 


/JO 


nh 


77 6 


Qy 


pin 
o ± u 




836 


PlTT 

Qy 


O D y 


nh 


ft QfS 

O 


Qy 


yz / 


nh 




Qy 


JO / 


nh 


1 ni 

J. U J. U 


Qy 


1047 


Db 


1076 


Qy 


1107 


Db 


1136 



PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I I I I I I : 1 : I I : I II Mill: I I I : I I II I I I II I I t I I i I I I I I I I I : I I I 
PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : I I I : I I I : I I : I : I I II I I I I : I I I I I I I I I I I I I : I I I I : I I I I : I I I I I I I 
SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

-ETVAQHK-EERLSASPQELGKPYLES FQPNLHSTKDA-ASNDI PTLTKKEKI SLQMEEF 809 

I : : : : : : I : I I I I I I I I I I I I i : : I : I I I : : : I I : I I I I I I I I I I 

FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: I I : I I I I I I III : I : I : I I I I I I I I I I I I I I I I I : I : I II I M : I II I I I I I 
STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 8 95 



I I 1 1 I I I I I I I I I I I I I I I I I I I : : : I I : I ! : I I : I I : I : I 
HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

III I I : I I I I I I I I I I I M I I I I I I I II I II: I I I I I I I I I I I I I II I I I I 
SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWXiLLYWRDIK 1015 

KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 104 6 

I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I 1 I I I I II I 
KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

YL E S EVAI SEE LVQ K Y S N S AL GHVN S T I K EL RRL FLVD DLVD S L K FAVLMWVFT YVGAL F 1106 

I I I I I I I I I I I I I I I I I II I I II I I I I I I I M I I I I I I I I I 1 I I I I I I I II I I I 1 I I I I 

YLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

NGLTLLI LALI SLFS I PVI YERHQVQI DH YLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I I I I I I I I I I : I II I I I I I I I I I I I I I II I : I I I I I I I I I I I I I I I M I I : 
NGLTLLI LALI SLFSVPVIYERHQAQIDHYLGLANKWKDAMAKIQAKIPGLKRKAE 1192 



RESULT 7 


ABP6J 


3600 


ID 


ABP68600 standard; protein; 1192 AA. 


XX 




AC 


ABP68600; 


XX 




DT 


14-JAN-2003 (first entry) 


XX 




DE 


Human pancreatic cancer expressed protein SEQ ID NO 71. 


XX 




KW 


Human; pancreas; cancer; gene therapy; vaccine; immuno stimulant ; 


KW 


cytostatic; tumour . 


XX 




OS 


Homo sapiens. 


XX 




PN 


WO200260317-A2. 


XX 




PD 


08-AUG-2002. 



XX 

PF 30-JAN-2002; 2002WO-US002781 . 
XX 

PR 30-JAN-2001; 2001US-02 65305P . 

PR 31-JAN-2001; 2001US-0265682P . 

PR 09-FEB-2001; 2001US-0267568P . 

PR 21-MAR-2001; 2001US-0278651P . 

PR 28-APR-2001; 2001US-0287112P . 

PR 16-MAY-2001; 2001US-02 91631P . 

PR 12-JUL-2001; 2001US-03054 84P . 

PR 20-AUG-2001; 2001US-0313999P . 

PR 27-NOV-2001; 2001US-0333626P . 
XX 

PA (CORI-) CORIXA CORP. 
XX 

PI Benson DR, Kalos MD, Lodes MJ, Persing DH, Hepler WT, Jiang Y; 
XX 

DR WPI; 2002-627435/67. 

DR N-PSDB; ABV94680. 
XX 

PT New isolated polynucleotide and pancreatic tumor polypeptides , useful for 

PT diagnosing, preventing and/or treating cancer, particularly pancreatic 

PT cancer. 
XX 

PS Claim 2; SEQ ID NO 71; 300pp + Sequence Listing; English. 
XX 

CC The invention relates to an isolated polynucleotide (I) comprising: (a) 

CC any of a group of over 4000 nucleotide sequences (ABV94 628-ABV99145 ) ; (b) 

CC complements of (a) ; (c) sequences consisting of at least 20 contiguous 

CC residues of (a); (d) sequences that hybridize to (a), under moderately 

CC stringent conditions; (e) sequences having at least 75% or 90% identity 

CC to (a); or (f) degenerate variants of (a). Polypeptides (ABP68596- 

CC ABP68 637) encoded by (I) and oligonucleotide can be used to detect cancer 

CC in a patient and compositions comprising polypeptides, polynucleotides, 

CC antibodies, fusion proteins, T cell populations and antigen presenting 

CC cells expressing the polypeptide are useful in treating pancreatic cancer 

CC and stimulating an immune response. The polynucleotides can be used as 

CC probes or primers for nucleic acid hybridisation, in the design and 

CC preparation of ribozyme molecules for inhibiting expression of the tumour 

CC polypeptides and proteins in the tumour cells, in vaccines and for gene 

CC therapy. Note: The sequence data for this patent did not form part of the 

CC printed specification, but was obtained in electronic format directly 

CC from WIPO at ftp.wipo.int/pub/published_pct_sequences 
XX 

SQ Sequence 1192 AA; 

Query Match 75.3%; Score 4403.5; DB 5; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 1.3e-221; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Mhlll I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I :: II II 11 I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 P AAGL S AAAVP — P AAAAP LLD F S S D S VP PAP RGP L P AAP P AAP E RQ P S WERS P AA P 115 

MINIM I ! I I I I I I : I I : I I I I I I I I II I I I I I I I I M I I I : I I : I 
Db 59 P AAGL S AAP VP T AP AAGAP LMD FGN DFVP PAP RG P L P AAP P VAP ERQ P S WD P S P VS S T VP 118 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Ml | 1 I I I II I I I I I II I I I I I I I I I I III I I I I II I I I I I I I I 

119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 17 8 

167 RG-SGSVDETLFALP7yVSEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I M I I I I I I I I I M I I I I Ml I I I I I I : I : I I II I I I I I I I I I I I I I I 
179 RGSSGSVDETLFALPAASEPVIRS SAEN-MDLKEQPGNTI SAGQEDFPSVLLETAASLPS 237 

226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I I II I I I I I I I I I : I I I : : I : : M I I I : I : I : : I I I I I M I I I 
238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

28 6 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KED 339 

I I I M I I I I , I I , : : I I : i I : I : : I ! : , : I I : I I : I I I III 

298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

34 0 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

Ml II I II I : : : I I I : I I I I I I I I I M : INN: I I hill : I : 

358 EWS S EKAKD S FNEKRVAVEAPMREE YADFKP FERVWEVKD S KED S - DMLAAGGKI ESNL 416 

396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

II I || : I I Mill: I I I I I : I I I I I I I I : II I I I II I I I : I I I I 

417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 47 6 

455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I I I I I I I I I I I I I I I II : I I 1 I : I I I II I I I I I I I I I I I I I MINI I : 
477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

514 KVTEAAVSNMP EGLT P DLVQEACES ELNEAT GTKI AYETKVDLVQT S EAI QES LYPTAQL 573 

I I I I I : I I I I I I I I I I I I I I I I I I II I I I I I I I I 1 I I : I I I I I M : I I I I I I III 
537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I | I || I : I I II I I I I I I I I I I I I M I : I I I I I I I : M I I I I I I I : I : I I I I II I 
597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASV1QPSSSPLEA-SSVNYESIKHEPEN 655 

634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

M | | | I I I : I : I I : I II INN: II I : I I M I II I I II I M I II M I I I I : M I 
656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 7 52 

| : | || : I I I : I I : I : II II I II I : II II II I I II I II : I I I Ml I I M I I I I I I I 
716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 8 09 

I : : : : : : I : II I I I I I II I II I : : I MM : : : I M II I I I Mill 

77 6 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 8 35 

810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

M M II I II I III : I : M I I I II II II II I I II I I MM M I I I : II I I M M 
836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

8 69 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK--DEVHVSDEFSENRSSVSKASISPSNV 926 

II II II II I I I I II I II I I M II : : : II M I : I I : I 1 : I M 

8 96 HKSEIJ^NAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 





927 


DID 


Q R f> 




9R7 


UD 


± u ± o 




1047 

X W *1 / 


Db 


1076 


Qy 


1107 


Db 


1136 



SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 98 6 

Ml | | : | | | I I I I II I I I I II I I I I M I I II: II I I I I I I I I I I I I I II I I 
SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 



I I I I I I I I I I I I I I II I M I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I I I I 



YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I 

YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 



I | | I I I I I I I I I I I I : I I II I I I I I I I I I I I I I I I : I I M I I I I I I I I I I I I M 



RESULT 8 
ABR59667 

ID ABR59667 standard; protein; 1192 AA. 
XX 

AC ABR59667; 
XX 

DT 22-JUL-2003 (first entry) 
XX 

DE Human NogoA protein. 
XX 

KW Human; Nogo receptor; NgR; CTS domain; neuroprotective; gene therapy; 

KW axonal growth; central nervous system; CNS; Nogo; spinal cord injury; 

KW cranial trauma; cerebral trauma; spinal trauma; stroke; Krabbe 1 s disease; 

KW demyelinating disease; multiple sclerosis; monophasic demyelination; 

KW encephalomyelitis; multifocal leukoencephalopathy; panencephalitis. 

XX 

OS Homo sapiens. 
XX 

PN WO2003031462-A2. 
XX 

PD 17-APR-2003. 
XX 

PF 04-OCT-2002; 2 002WO-US032 0 07 . 
XX 

PR 06-OCT-2001; 2001US-00972599 . 
XX 

PA (UYYA ) UNIV YALE. 
XX 

PI Strittmatter SM; 
XX 

DR WPI; 2003-393433/37. 
DR N-PSDB; ACC81048. 
XX 

PT New human Nogo receptor polypeptides and nucleic acids, useful for 
PT decreasing inhibition of axonal growth by a central nervous system 
PT neuron, or in treating central nervous system disease, disorder or 
PT injury, e.g. spinal cord injury. 
XX 

PS Disclosure; Page 131-135; 148pp; English. 
XX 



CC The invention relates to a novel nucleic acid encoding a polypeptide 

CC comprising amino acid residues 27-309 of a 473 amino acid sequence (PI, 

CC human Nogo receptor (NgR) NTLRRCT domain), or residues 27-309 of PI with 

CC 1-20 conservative amino acid substitutions, and less than a complete CTS 

CC domain, provided that a partial CTS domain, if present, consists of no 

CC more than the first 39 consecutive residues. The nucleic acid of the 

CC invention has neuroprotective activity. The polynucleotide may have a use 

CC in gene therapy. The nucleic acid is useful for decreasing inhibition of 

CC axonal growth by a central nervous system (CNS) neuron. The NgR 

CC polypeptide or an agent inhibits the binding of Nogo to NgR or NgR- 

CC dependent signal transduction in the central nervous system neuron may be 

CC used in treating central nervous system disease, disorder or injury, e.g. 

CC spinal cord injury. Expression of an NgR protein may be associated with 

CC inhibition of axonal regeneration following cranial, cerebral or spinal 

CC trauma, stroke or a demy el ina ting disease, such as multiple sclerosis, 

CC monophasic demyelination, encephalomyelitis, multifocal 

CC leukoencephalopathy, panencephalitis, or Krabbe r s disease. The present 

CC sequence is used in the exemplification of the invention 

XX 

SQ Sequence 1192 AA; 

Query Match 75.3%; Score 4403.5; DB 6; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 1.3e-221; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I MINI I I I I I I I I I II I I I I ! I I I : I I :: I ! I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I I I I I I I I I : I I : I I I I I I I I I I I I I I I I I I I I 1 I i : I I : I 
Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I M M I M I I I I I I I II I I II III I I I I I I : I : I I I II I II I II I I I I I I I 

Db 17 9 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I I II I I I I I I I I I : I I I : : I : : I I I I I : I : I : : I M I I I I I I I 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KED 339 

I I I I I I I I I I I I I : : I I : I I : I I : : M : I : I I : I I : I I I I I I 

Db 298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

Mill I M I : : : I I I : M I I II I II I I : I I I I I : I I I : I I I : I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

I I I II : I I Mill: II I I I : I II II II I M I I I II I I I I : I I I I 
Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 476 



Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

| | | | | 1 | 1 M | I I I I I II I : I I I I : I I I II I I M I I I I I I I I I I I I I I I I : 
Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I I I I : I 1 I I I I I I M I I I 1 I I M I I I I I I : I M I II I : I I II 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I | | | | | : || I I I I I I I I I I I I II I I I : I I I I I I I : I II I I I I I I : I : I I I MM 
Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

| | | | | M | : | : | | : I II Mill: I II : M I I I II I I I II I II I II II I I I : I I I 
Db 656 PPPYEEANSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLS7VEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 7 52 

| : | | | : | | | : I I : I : II I I M I I : I II II II I I I I I I : I I I Ml I M : I I I I I I I 
Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Q y 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : :: : I : I I I I I I I I I II I M : I MM : :: I M Mill 

Db 77 6 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: | | : || | || I Ml M M M II II I I II I I I II II I MM II II I M I I II I I I 
Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASI SPSNV 926 

I I I I M I I I I II I I I I II II I I I : : : I I M I M I : I I : I M 
Db 8 96 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

Ml | | : | | I I I I I II I I I I I I I II I I I II II: I I I I I M I I M II I I II II 
Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Q y 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

| | | | | | | I I I || I II II I I I II I M II M I II I II I I I I I I II II II 11 II I I I I II I I I 
Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

I | M I II I II I I II I I II I I II I II I I M I II I I II I I I II II I I II I I II II I I M I I 
Db 1076 YLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLI LALI S LFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I || | | I I I II I II I I M II I II I I I I I I M I M M M II I II II II I I I II II II : 
Db 1136 NGLTLLI LALI SLFSVPVI YERHQAQIDHYLGLANKNVKDAMAKIQAXI PGLKRKAE 1192 



RESULT 9 
AAY56967 

ID AAY56967 standard; protein; 1192 AA. 
XX 

AC AAY56967; 
XX 

DT 25-APR-2000 (first entry) 
XX 



DE Human MAGI polypeptide. 
XX 

KW MAGI protein; neuroendocrine-specific. protein; neuropathy; human; 

KW spinal injury; neuronal degeneration; neuromuscular disorder; cancer; 

KW psychiatric disorder; developmental disorder; inflammatory disorder; 

KW stroke; cytostatic; cerebroprotective ; neuroprotective. 

XX 

OS Homo sapiens . 
XX 

PN WO200005364-A1. 
XX 

PD 03-FEB-2000. 
XX 

PF 21-JUL-1999; 99WO-GB002360 . 
XX 

PR 22-JUL-1998; 98GB-0001 602 4 . 

PR 19-JUL-1999; 99GB-00016898 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha RK; 
XX 

DR WPI; 2000-182693/16. 

DR N-PSDB; AAZ56886. 
XX 

PT Novel polypeptides related to neuroendocrine-specific proteins and 

PT polynucleotides useful for diagnosis of various diseases and for 

PT treatment of cancer and neurological disorders. 
XX 

PS Claim 2; Page 20-21; 35pp; English. 
XX 

CC The invention relates to human MAGI protein, which is similar to 

CC neuroendocrine-specific protein. The MAGI protein can be expressed by 

CC standard recombinant methodology. The MAGI polypeptides, polynucleotides 

CC and antibodies are useful for treating diseases, including neuropathies, 

CC spinal injury, neuronal degeneration, neuromuscular disorders, 

CC psychiatric disorders and developmental disorders, cancer, stroke and 

CC inflammatory disorders. The polynucleoitde is also useful for chromosome 

CC localization and for tissue expression studies. The present sequence 

CC represents the human MAGI protein 

XX 

SQ Sequence 1192 AA; 

Query Match 75.2%; Score 4398.5; DB 3; Length 1192; 
Best Local Similarity 75.9%; Pred. No. 2.3e-221; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 IMM MINI I I I I I I I I I I I I I I I I M I : I I : : I M I I M I I I M I 

Db ! MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP--PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

| | I I I I I I I I I I I I M : I I : I I I I I I II I I I I I I I I I I M I M : I I : I 

Db 59 p AAGL S AAP VPT AP AAGAP LMD FGN D FVP PAP RGP L P AAP P VAP ERQPSWDPS P VS S T VP 118 

Qy H6 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE P AAP P S T P AAP KR 166 

Ml I I I I I M i I I I I M M II II II II I i N I I M I I I I I I I I I 



Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

M | I I I I I I I 1 I M M I I I I I I I : I : I I I I I I I I I I I I I I I I I I 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETTiJVSLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill Mill I I I I I I I :lil::l ::IIMI: Ml :: Mill 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KED 339 

I M M M III IIMM I MMIMMMM II : IM II I Ml 
D b 298 SEMGS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKED 357 

Qy 34 0 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

Mill I M I : : M I M II I I I I M M M I II I M I I I : I M : I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

I M I I : II Mill: M I I Ml M M M I Ml II II II I Mill 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAP FNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

| M II I I II I II II I II I I : II I I : I I I M I II I I M I I I I I I I I I I M I: 
Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

II II I : II II II M I II M II II II M I II I II I I M M II I II I M I M I I III 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I || | | | : I I I I II II I I II II II II I : I I I II I I : I II Mill M I M II MM 
Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

M M II I I : I M I : I II I I I M : II I M II II II M II I II II I M II I M II I 
Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

| : | | | : || | : I I : I M I II II I I : I I II I I I II I I I I : I I I Ml M M M M I I I 
D b 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

| : : : : : : I : I I I I I I I I I I II I : : I MM : : : I M I I I M Mill 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

M I : M II I I III M M M II I II II I II II I I I I MM II II I M I I II M I 
Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK— DEVHVSDEFSENRSSVSKASI SPSNV 926 

I I I I M II II I I I II II I I I I I I 11 = 11 = 1 M II = I M 

D b 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

I I I | I : MM I I M M II II II I I II I II I I : I II II I II I I I M I I I II I 
Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 



987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 
1 | | | I I | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I I I I 
1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

1047 YLESEVAISEELVQKYSNSALGHWSTIKELRRL^ 1106 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I M M I I I I II I I 

1076 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGAL 1135 

1107 NGLTLLI LALI SLFSI PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I I I I II I I I I : I I I I I I I I I I I I I M I I I I : I I I I I I I I I I I I I II I I I I : 
1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGIANKWKDAMAKIQAKIPGLKRKAE 1192 



RESULT 10 
AAB82349 

ID AAB82349 standard; protein; 1192 AA. 
XX 

AC AAB82349; 
XX 

DT 23-JUL-2001 (first entry) 
XX 

DE Human NOGO-A protein. 
XX 

KW NOGO-A; human; chromosome 2p21; neuropathy; spinal injury; brain injury; 

KW stroke; neuronal degeneration; Alzheimer's disease; Parkinson's disease; 

KW neuromuscular disorder; psychiatric disorder; developmental disorder; 

KW neuroprotective; nootropic; neuroleptic; antiparkinsonian; 

KW cerebroprotective; neuroleptic; diagnosis; therapy. 
XX 

OS Homo sapiens. 
XX 

PN WO200136631-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 14-NOV-2000; 2000WO-GB004345 . 
XX 

PR 15-NOV-1999; 99GB-00026995 . 

PR 24-JAN-2000; 2000GB-00001550 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha R; 
XX 

DR WPI; 2001-343822/36. 

DR N-PSDB; AAF90324. 
XX 

PT New polypeptide designated NOGO-C is a splice variant of the human NOGO 

PT gene and may be useful in the treatment of neural disorders including 

PT Alzheimer's and Parkinson's diseases. 
XX 

PS Disclosure; Page 26-27; 25pp; English. 
XX 

CC The present sequence is that of human NOGO-A. NOGO-A is a previously 

CC known splice variant of the human NOGO gene on chromosome 2p21. The 

CC invention relates to a novel splice variant, NOGO-C (see AAB82348) . It 



CC provides NOGO-C polypeptides and polynucleotides, and methods for 

CC producing such polypeptides by recombinant techniques. Also disclosed are 

CC methods for utilising NOGO-C polypeptides and polynucleotides in the 

CC treatment of diseases including neuropathies, spinal injury, brain 

CC injury, stroke, neuronal degeneration, for example Alzheimer ! s disease 

CC and Parkinson's disease, neuromuscular disorders, psychiatric disorders 

CC and developmental disorders. Also provided are methods for identifying 

CC agonists and agonists for use in treating conditions associated with NOGO 

CC -C imbalance, and diagnostic assays for detecting diseases associated 

CC with inappropriate NOGO-C activity or levels 

XX 

SQ Sequence 1192 AA; 

Query Match 75.2%; Score 4398.5; DB 4; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 2.3e-221; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQEVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

| | | : M I I I I I I I 1 I I I I I I I I I I I I II I I I I II I I I : II - I M I I I I I I I I II 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

| | | | | | || || III I I I : I I : I I I I I I I I I I I I I II I I I II I I I : II : I 
Db 59 p AAGL S AAP VP T AP AAGAP LMD FGND EVP PAP RGP L P AAP PVAP ERQ P S WD P S P VS S T VP 118 

Qy H6 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP KR 166 

Ml I I I I I I I I I I I I II I II I I I I I II II II I I I I I I I I I I I II 

D b 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 17 8 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

|| | I I I I || I I I II I I I I I I I I I I I I I I I : I : I I I I I I II I I I I I I I I II 

D b 179 RGSSGSVDETLFALP7^ASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I Mill MINI I Mll::| ::lllll: Ml ::IM lllllll 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KED 339 

Mill I I I : i I I : I M I I = : I M M II : IM II I HI 

Db 298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVIAA RANV 395 

| M II I IN : : : I I M I i II I II I II M I I I I I : I I I I I I : I : 

Db 358 EWS S EKAKDS FNEKRVAVEAPMREE YADFKP FERVWEVKDS KEDS - DMLAAGGKI E SNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

|| II I : I I Mill: I M I Ml lllllll : I I lllllll : I M I 
Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

| || | | | | I I I I I II I II I I : II I I : I II I I M I M II II I I II I I II I I I : 
Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

MM I : I II I I I I II I I I I I I I I M I I M I M M II M II I II I I : I I I I I i M I 
Db 537 KVTEE WANMP EGLT P DLVQEACE S ELNEVT GT KI AYET KMDLVQT S EVMQES LYPAAQL 59 6 



Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

| M 11 I : I I I M I I I I I I I I I I I I I I : I I I I I I I = I I I I I I I I I : I : I I I I 1 I I 
Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL^GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I M M I t : I : I I : I I ■ MhlMIMI MINI Ihlll 

Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

| : M I : I I I : I I : I : I I I I I I I I : I I II I I I II I I I I : I I I I : I M I : I I I I I I I 
Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Q y 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

| : : : : : : | : I I | I I I I I I II I I : : I : I I I : : : I I : I I I I I Mill 

Db 77 6 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKIiAKEYTDLEVS 868 

: | | : | | | M | IN : I : I : I I I I I I I I I I I I I I I I I : I : I II I I I : I I I I I I I I 
Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 926 

I I I I I I I I II I I I I I I I I I I I I I I I : I I : I I = I I : I : I 

Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIA/XSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSVVDLLYWRDIK 986 

Ml | |: | | | | I I I I I I I I I I II I I I I II I II: I I I 1 I I I I I I I I I I I I I I I 
Db 956 SALATQAEIESIVKPKVLVKE7VEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGVVFGASLFLLLSLTVFSIVSVTAYITUjALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

II | M I I I I I I I I I I I I I II I I 1 I I I II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I II 

Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

| | | | | | M I II I I II I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I 

Db 1076 YLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

| | | | | | | I I I I I I II : I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I M I I I I : 
Db 1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 11 
ABG30938 

ID ABG30938 standard; protein; 1192 AA. 
XX 

AC ABG30938; 
XX 

DT 21-OCT-2002 (first entry) 
XX 

DE Human NogoA protein. 
XX 

KW Human; Nogo; BACE; acute neuronal injury; spinal injury; head injury; 

KW stroke; peripheral nerve damage; neoplastic disorder; glioblastoma; 

KW neuroblastoma; hyperprolif erative disorder; dysprolif erative disorder; 

KW cirrhosis; psoriasis; keloid formation; fibrocystic condition; cancer; 

KW tissue hypertrophy; central nervous system; axon regeneration; NogoA; 

KW Nogo-associated disease; metastasis. 



OS Homo sapiens. 
XX 

PN WO200257483-A2. 
XX 

PD 25-JUL-2002. 
XX 

PF 18-JAN-2002; 2 002WO-GB00022 8 . 
XX 

PR 18-JAN-2001; 2001GB-00001312 . 
XX 

PA (GLAX ) GLAXO GROUP LTD. 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 

XX 

PI Blackstock WP, Hale RS, Prinjha R, Rowley A; 
XX 

DR WPI; 2002-599722/64. 

DR N-PSDB; ABK90134. 
XX 

PT Identifying modulators of Nogo or BACE activity for treating acute 

PT neuronal injuries, neoplastic or dysprolif erative disorders, comprises 

PT providing and monitoring interaction between Nogo and BACE polypeptides. 

XX 

PS Disclosure; Page 59-62; 68pp; English. 
XX 

CC The present invention relates to a new method of identifying modulators 

CC of Nogo function or BACE activity. The method involves providing Nogo and 

CC BACE polypeptides capable of binding with each other, monitoring the 

CC interaction between these polypeptides, and determining if the test agent 

CC is a modulator of Nogo or BACE activity. The method is useful in treating 

CC acute neuronal injuries, such as spinal or head injury, stroke, 

CC peripheral nerve damage, and in neoplastic (e.g. glioblastomas, 

CC neuroblastomas), hyperprolif erative or dysprolif erative disorders (e.g. 

CC cirrhosis, psoriasis, keloid formation, fibrocystic conditions, tissue 

CC hypertrophy) of the central nervous system. The BACE polypeptide is 

CC useful in screening methods to identify agents that may act as modulators 

CC of BACE activity and in particular agents that may be useful in treating 

CC Nogo-associated diseases. The modulators of Nogo or BACE polypeptides, 

CC and the polynucleotide encoding the BACE polypeptide are useful in 

CC manufacturing a medicament for the treatment or prevention of disorders 

CC responsive to the modulation of Nogo activity, in alleviating the 

CC symptoms or improving the condition of a patient suffering from this 

CC disorder, in axon regeneration, or in preventing metastasis or spreading 

CC of a cancer. The polynucleotide may also be an essential component in 

CC assays, a probe, in recombinant protein synthesis, and in gene therapy 

CC techniques. The present amino acid sequence represents the human NogoA 

CC protein of the invention 

XX 

SQ Sequence 1192 AA; 

Query Match 75.2%; Score 4398.5; DB 5; Length 1192; 
Best Local Similarity 75.9%; Pred. No. 2.3e-221; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20 



Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

|||:||| I I I I I I I I I I I I I I I I M I I I I I I I I I I I I : I I I I I I 1 M I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 



Qy 


61 


Db 


59 


Qy 


116 


Db 


119 


Qy 


167 


Db 


179 


Qy 


226 


Db 


238 


Qy 


286 


Db 


298 


Qy 


340 


Db 


358 


Qy 


396 


Db 


417 


Qy 


455 


Db 


477 


Qy 


514 


Db 


537 


Qy 


574 


Db 


597 


Qy 


634 


Db 


656 


Qy 


693 


Db 


716 


Qy 


753 


Db 


776 


Qy 


810 


Db 


836 



PAAGLSAAAVP — PAAAAPLLDFS S DS VPPAPRGPLPAAP PAAPERQP SWERS PAA P 115 

I I I I I I I I I I I I I I I I : i I : I I I I I I I I I I I I I I I 1 I I I I I II : M : I 
PAAG L S AAP VP T AP AAGAP LMDFGND FVP PAP RG P L PAAP P VAP ERQPSWDPS P VS S T VP 118 



166 



I I I II I I I I I I I I I I II I II I I I I I I I MM I I I I II M I M II 

AP S P L S AAAVS P S KL P EDD E P P ARP P P P P P AS VS P QAE P VWT P P AP AP AAP P S T PAAP KR 178 

RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I I I II II I II II II MM III I I I I I M I : I I I I I I I I I I I I I I I I I I 
RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill I I II I I I I II I I : I M : : I - = 11111= Ml -Ml I I I I I I I 

LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 2 97 

SEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KED 339 

I M I I M II I I I I : : I I M I : II : M M M I I : I I : M I III 
SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTT^LTKLVKED 357 

RWS PEKTMDI FNEMQMS WAPVREEYADFKPFEQAWEVKDT YEGSRDVLAA RANV 395 

Mill I I I I :: M I M II II I M I II M I I II M I I M I II : I : 

EWS S EKAKDS FNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS- DMLAAGGKI ESNL 416 



II II I : I I Mill: II I I I : I II I I II I : I I II I I I I I Mill 



513 



I I II I I M II I II II I II I M II I : II I II I II I II I II I I I I MINI I 



KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I II I M I I I II II II I II I I I M I M I M II II II II : I I II II I M II II I Ml 
KWEEVVANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

II II I M II I I II I II M II I I I I I I : I II I II I M I I I I I II I : I M I I II I I 
CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

II II I I I I : M I I : I II MUM II I M I I I II I II I I I II I I II I II 11 = 111 
PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 7 52 

I : | I I: II M I I : I : I II II I I I : I I II M I I I I I I I : I I I Ml II I : II I I II I 
SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 77 5 

-ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : M M II I I I I I II II I : M MM : : : I I : I II I I I I I I I 

FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 8 35 

NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 8 68 

M M I I M II III M M M I II I I I I I I I II I I I I MM II II I M I I I II I I 



Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 926 

| | I I I I I I I I I I I I I I I I I I I I I I I : I I : I I : I I : I : I 

Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 92 7 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

Ml | I : I I I I I I I I I I I I I I I I I I I I I I I II: I I M I I I I I I I II I I I I I I 
Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 104 6 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I 
Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II 1 II I I I I I I I I 
Db 1076 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAWiMWVFTYVGALF 1135 

Qy 1107 NGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAD 1163 

I II I I I I I I I I I I I I : II I II I II I I I I I I I I I I I : I II I II I I M I I I I I M I I : 
Db 1136 NGLTLLI LALI SLFSVPVIYERHQAQ I DHYLGIANKNVKDAMAKIQ7VKIPGLKRKAE 1192 

RESULT 12 
ABB81078 

ID ABB81078 standard; protein; 1192 AA. 
XX 

AC ABB81078; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Human neurotransmitter receptor protein Nogo-A. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 

KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 

KW vulnerary; cerebroprotective ; anti-tumour; antidiabetic; anticonvulsant; 

KW nootropic; antiparkinsonian; ophthalmological; analgesic; hepatotropic; 

KW osteopathic; vasotropic; nephrotropic ; cytostatic; antigen; gene therapy; 

KW neurotransmitter receptor; human; receptor. 
XX 

OS Homo sapiens. 
XX 



PN 


US2002072493 


-Al. 




XX 








PD 


13-JUN-2002 . 






XX 








PF 


28-JUN-2001; 


2001US- 


00893348 . 


XX 








PR 


19-MAY-1998; 


98IL- 


00124500. 


PR 


21-JUL-1998; 


98WO- 


US014715. 


PR 


22-DEC-1998; 


98US- 


00218277 . 


PR 


19-MAY-1999; 


99US- 


00314161. 



XX 

PA (YEDA ) YEDA RES & DEV CO LTD. 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 



DR N-PSDB; ABN86601. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in the 

PT central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides . 

XX 

PS Example; Page 53-56; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central/peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease is 

CC diabetic neuropathy, senile dementia, Alzheimer f s disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt-Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma- 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates ) , Charcot-Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents the human 

CC neurotransmitter receptor protein Nogo-A, an example of NS-specific 

CC antigen 

XX 

SQ Sequence 1192 AA; 

Query Match 75.2%; Score 4398.5; DB 5; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 2.3e-221; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 



Qy 



1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 



60 



Db 



i i i * i i I i i i i i i i i i i i i i i i i i i •*''**'*'*' 

1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 



58 



QY 



61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 



115 



Db 



5 9 PAAG L S AAP VP T AP AAGAP LMD FGNDFVP PAP RG P L PAAP P VAP ERQPSWDPSPVSS T VP 



118 



Qy 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP KR 



166 




Db 



119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 



178 



167 RG~SGSVDETLFALPAASEPVIPSS7\EKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I I I 1 I I I I I I I I II II I I III I I I I I I : I : I I I I I I I I I I I I I I I I I I 
179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 2 85 

I I I I I I I I I I II I I I I I : I I I : : I : : I I I I I : I : I : : I I I I I I I I I I 

238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

286 SEMGS S FKGS PKGES AI LVENTKEEVI VRSKDKED-LVCSTUVLHS PQES PVG KED 339 

I I I I I I I I II I M : : I I : I I : II : : I I : I : I I : I I : I I I III 

298 S EMGS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKED 357 

340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

I I I I I I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I i I : I I I : I : 

358 EWS S EKAKDS FNEKRVAVEAPMREE YAD FKP FERWEVKDS KED S - DMLAAGGKI ESNL 416 

396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

11111:11 Mill: I I I I I : I I I I I I I I : I I I I I I I I I : I I I I 
417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 47 6 

455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I I I I I I I I I I I I I I I I I : I I I I : I I I II I I I I I I I I I I I I I I I I I I I I : 
477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

II I I I : I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I : I I 1 I I I III 

537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I I I I : I I I I I I I I I I II I I I I II I : I I I I I I I : I I I I I I I I I : I : I I I I I I I 
597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I II I I I : I : I I : I II Mill: I I I : M I I I I II I I II II II II I II M : M I 
656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : I II : M I : II : I M I II II I I : II II M II I I I II : II I Ml I I M I II I I I I 
716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

7 53 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 8 09 

I : : : : : =1 = 111 I I I I II M I I : : I MM : : : II M I II I II II I 

77 6 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 8 68 

: II M I I II I III : I M : II II M I II II II II M MM II II I M I M II II 
836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 926 

I I II II II II II I II II I M I II : : : I I M I M I : I I : I M 

8 96 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

I I I I I : II II I I II I I I I I II I I II II I I II: II I II I I I I II I I I I I I M 
956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVT1SFRIYKGVIQAIQKSDEGHPFRA 104 6 



Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

I | | I I I I I I I I I I I I I I I I I I I II I I II I I I II 1 I I M I I I I I I I I I I I I I I I I I I I I I 
Db 1076 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLI LALI SLFSI PVIYERHQVQIDHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I || I I I I I I I : I M I I I I I I I I II I I I I I I : I II I II I I I I I I I I I I I I I : 
Db 1136 NGLTLLI LALI SLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 13 






AAY71311 






ID 


AAY71311 standard; protein; 1178 AA. 


XX 








AC 


AAY71311; 






XX 








DT 


02-NOV-2000 (first entry) 


XX 








DE 


Human neurite growth inhibitor Nogo . 


XX 








KW 


Human; neurite growth : 


inhibitor; Nogo; neural cell; myelin; CNS; 


KW 


central nervous 


system 


; neoplastic disease; antiproliferative; glioma; 


KW 


antisense gene therapy 


; neuroblastoma; menagioma; retinoblastoma; 


KW 


degenerative nerve disease; Alzheimer f s disease; Parkinson's disease; 


KW 


hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 


KW 


psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 


KW 


structural plasticity; 


screening . 


XX 








OS 


Homo sapiens . 






XX 








FH 


Key 


Location/ Qualifiers 


FT 


Misc-dif f erence 


187 




FT 




/labels 


= Unknown 


FT 


Misc-dif f erence 


188 




FT 




/labels 


= Unknown 


FT 


Misc-dif f erence 


189 




FT 




/labels 


= Unknown 


FT 


Misc-dif f erence 


190 




FT 




/label 


= Unknown 


FT 


Misc-dif f erence 


221 




FT 




/label 


= Unknown 


FT 


Misc-dif f erence 


328 




FT 




/label 


- Unknown 


FT 


Misc-dif f erence 


477 




FT 




/label 


= Unknown 


FT 


Region 


977 . . 


1012 


FT 




/note= 


"Region specifically described in claim 16" 


FT 


Region 


994. . 


1174 


FT 




/note= 


"Region specifically described in claim 16" 


FT 


Region 


1079. 


. 1114 


FT 




/note= 


"Region specifically described in claim 16" 


XX 








PN 


WO200031235-A2. 






XX 








PD 


02-JUN-2000. 







XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 

PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Claim 11; Fig 13; 122pp; English. 
XX 

CC The present sequence is a human Nogo protein which is a potent neural 

CC cell growth inhibitor and is free of all central nervous system (CNS) 

CC myelin material with which it is natively associated. The human Nogo 

CC sequence was derived by aligning human expressed sequence tags (ESTs) 

CC e.g. AA158636, AA333267, AA081783, AA167765, AA322918, AA092565, AA081525 

CC and AA081840 with the rat Nogo sequence. Nogo proteins and fragments 

CC displaying neurite growth inhibitory activity are used in the treatment 

CC of neoplastic disease of the CNS e.g. glioma, glioblastoma, 

CC medulloblastoma, craniopharyngioma, ependyoma, pinealoma, 

CC haemangioblastoma, acoustic neuroma, oligodendroglioma, menagioma, 

CC neuroblastoma or retinoblastoma and degenerative nerve diseases e.g. 

CC Alzheimer's and Parkinson's diseases. Therapeutics which promote Nogo 

CC activity can be used to treat or prevent hyperprolif erative or benign 

CC dysprolif erative disorders e.g. psoriasis and tissue hypertrophy. 

CC Ribozymes or antisense Nogo nucleic acids can be used to inhibit 

CC production of Nogo protein to induce regeneration of neurons or to 

CC promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. The animal 

CC models can be used in diagnostic and screening methods for predisposition 

CC to disorders and to screen for or test molecules which can treat or 

CC prevent disorders or diseases of the CNS. Note: SEQ ID numbers 35-42 are 

CC referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However the specification does not include sequences for 

CC these SEQ ID numbers 

XX 

SQ Sequence 1178 AA; 



Query Match 73.1%; Score 4276.5; DB 3; Length 1178; 

Best Local Similarity 73.8%; Pred. No. 5.5e-215; 

Matches 883; Conservative 104; Mismatches 158; Indels 51; Gaps 19 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

M I : I I I I I I M II III I I I I I I II I I I II I I I I I I : I I :: I I I I I I I I M I I I 
Db 1 MEDLDQSPLVSSS-DSVPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I I I I I I I I I : I I : I M I I I I I I M I I I I I I I I I I I I : I I : I 
Db 59 P AAGL S AAP VP T AP AAGAP LMD F GN D FVP PAP RG P L PAAP P VAP E RQ P S W D P S P VS S T VP 118 



Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE 



PAAP P S T PAAP KR 166 



Ill I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I II 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 17 8 

Qy 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 

Ml : I I I I I I M I I I : I : I I I I I I I I I I I I I I I I I I 

Db 179 RGSSG AVVXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

Qy 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

I I I I I I I I I I I I II I I : M I : : I : : II I I I : I : I :: I I I I I I I I I I I 

Db 225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

Qy 287 EMGS S FKGS PKGESAI LVENTKEEVI VRS KDKED- LVCSAALHS PQES PVG KEDR 340 

I I I I I I I I I I I I : : I I : I I : I I : : I I : I : I I : II III III 
Db 285 EMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 344 

Qy 341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 

Mill I I I I : : : I I I : I I I I I I I I I I I : I I I I I : I I I : I I I : I : I 

Db 345 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS-DMIiAAGGKIESNLE 403 

Qy 397 SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTTA 455 

||||:|| Mill: I II I I : I II I I II I : II I M II II I Mill 
Db 404 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 463 

Qy 456 NTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

I I I I I M I I II I I I II I II : M II Ml I II I I I I I I II III I I M II I M 
Db 464 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVT^QDSCTDYVTTDNLTK 523 

Qy 515 VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 

I I I I : I I M I M I I I I I I I I I I I I I I M I I I I I II I M I I I I M M I I I I I I I I I 
Db 524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 

Qy 575 PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 634 

I I I I I : I I I I I I I I I I I I I I I I I I I M I I I II I M I I I I I I I I M M II I I I I I 
Db 584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 

Qy 635 PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 694 

I I M I II M M I I II Mill: I I I M I II I I I I I I I I II I I I I I I I I I M I I I : 
Db 643 PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 702 

Qy 695 YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 753 

I I I M I I : M M M I II I I II M I I I I I M I I I I I M I I Ml I I I M I I I II I I 
Db 703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

Qy 754 TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEFNT 811 

: : : : : M : I I I I I II II II I I : M MM : : : II M I I I I I I II I M 

Db 7 63 SMIEYENKEKLSALPPEGGKPYLESFKLSLVNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 

Qy 812 AIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 870 

I M I I II I III M M M II I I II I I I I I I I I I I MM II I II : M I I I I I I I 
Db 823 AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 882 

Qy 871 SEIANIQSGADSLPCLELPCDLSFKNI YPK — DEVHVSDEFSENRSSVSKASISPSNVSA 928 

Mill I I I II I I II I II I II I I : : : I I M I M I : II : I M I I 
Db 883 SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 942 

Qy 92 9 L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 987 

I I I : I I II I I I I II I II I I II II I I I I II: MM I I II II I I II I I I M 



Db 



943 LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 1002 



Ov 


JO u 


T G WF GAS L FLL L S LT VF S I VS VTAY I ALAL L S VT I S FRI YKGVI QAI Q K S D E GH P F RAY 


1047 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 M 1 1 1 1 1 1 1 i 1 1 1 I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1003 


TGVVFGASLFLLLSLTVFSIVSVTAYIALALLSVT IS FRI YKGVI QAI QKSDEGHP FRAY 


1062 


Qy 


1048 


LESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTWGALFN 


1107 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1063 


LESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 


1122 


Qy 


1108 


GLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 






1 II 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 : M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 : 




Db 


1123 


GLTLLILALISLFSVPVIYERHQAQIDHYLGIANKNVKDAMAKIQAKIPGLKRKAE 1178 





RESULT 14 


AAU33228 


ID 


AAU33228 standard; protein; 1246 AA. 


XX 




AC 


AAU33228; 


XX 




DT 


18-DEC-2001 (first entry) 


XX 




DE 


Novel human secreted protein #3719. 


XX 




KW 


Human; vaccination; gene therapy; nutritional supplement; 


KW 


stem cell proliferation; haematopoiesis ; nerve tissue regeneration; 


KW 


immune suppression; immune stimulation; anti-inflammatory; leukaemia. 


XX 




OS 


Homo sapiens . 


XX 




PN 


WO200179449-A2. 


XX 




PD 


25-OCT-2001. 


XX 




PF 


16-APR-2001; 2 001WO-US008 656 . 


XX 




PR 


18-APR-2000; 2000US-00552929 . 


PR 


26-JAN-2001; 2001US-0077 0160 . 


XX 




PA 


(HYSE-) HYSEQ INC. 


XX 




PI 


Tang YT, Liu C, Drmanac RT; 


XX 




DR 


WPI; 2001-611725/70. 


XX 




PT 


Nucleic acids encoding a range of human polypeptides, useful in genetic 


PT 


vaccination, testing and therapy. 


XX 




PS 


Claim 20; Page 737; 765pp; English. 


XX 




cc 


The invention relates to novel human secreted polypeptides . The 


cc 


polypeptides and antibodies to the polypeptides are useful for 


cc 


determining the presence of or predisposition to a disease associated 


cc 


with altered levels of polypeptide. The polypeptides are also useful for 


cc 


identifying agents (agonists and antagonists) that bind to them. Cells 


cc 


expressing the proteins are useful for identifying a therapeutic agent 



CC for use in treatment of a pathology related to aberrant expression or 

CC physiological interactions of the polypeptide. Vectors comprising the 

CC nucleic acids encoding the polypeptides and cells genetically engineered 

CC to express them are also useful for producing the proteins. The proteins 

CC are useful in genetic vaccination, testing and therapy, and can be used 

CC as nutritional supplements. They may be used to increase stem cell 

CC proliferation; to regulate haematopoiesis ; and in bone, cartilage, tendon 

CC and/or nerve tissue growth or regeneration; immune suppression and/or 

CC stimulation; as anti-inflammatory agents; and in treatment of leukaemias. 

CC AAU2 9510-AAU33304 represent the amino acid sequences of novel human 

CC secreted proteins of the invention 

XX 

SQ Sequence 1246 AA; 

Query Match 70.4%; Score 4116; DB 4; Length 1246; 

Best Local Similarity 72.0%; Pred. No. 1.4e-206; 

Matches 873; Conservative 112; Mismatches 171; Indels 56; Gaps 27 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I I I I I I I I I I I I I I I I I II I I I I I I I : I I : : I I I II I I I II I II 
Db 42 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 99 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I II III I II : I I : I I I I I I I I I I I I I I I I I I I II I I : II : I 

Db 100 P AAGL S AAP VP T APAAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S W D P S P VS S T VP 159 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 160 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 219 

Qy 167 RG-SGSVDETLFALPAASEPVIPSS7^EKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II I I I I I I I I I I I I I I I I I I I I II I III I I I I II : I : I I I I I I I I I I I I I I I I I I 

Db 220 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 278 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I II I I I I I I I MINI I : I I I : : I : : I I I I I : I : I : : I I I I II I I I I 

Db 279 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 338 

Qy 286 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KED 339 

I I I I I I I | | | | | | : : | I : I I : II : : I I : I : I I : I I : I I I III 

Db 339 SEMGS SFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 398 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

Ml II I III : : : I II : II I I I I I I I I I : INN: I I Mill = I : 

Db 399 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 457 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 454 

II II I : II I I I I I : I II I Ml I I I I II I : II I I II I M I Mill 

Db 458 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 517 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I II I I I I II I I I I II I II I M I I I : I I I II II II I II II EMI MUM I : 
Db 518 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 577 

Qy 514 KVTE7VAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I II I : II I I II I II II II II I I I I I I I II II I II II : II II II I : I I I I II II I 
Db 578 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 637 



Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I I I I : I I I I I I I I I M I I II I I I I : I I I I I I I : I I I I I I I I I : I : I I I lift 
Db 638 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGA3VIQPSSSPLEA-SSVNYESIKHEPEN 696 



Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I I I I I I : I : I I : I II Mill: M I : I I I I I I I I I II I I I I I I I I I I 11 = 111 
Db 697 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 756 



Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : I I I : I I I : I I : I : I I I I I I I I : I I II I I I I I I I II : I I I hi I I I : I I I I I I I 
Db 7 57 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 816 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : : I : I I I 1 I I I I I I I I I : : I : I I I : : : I I : I I I I I I I I I I 

Db 817 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 876 



Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: I I : I I I I I I III : I : I : I I I I I I I I I I I I I I I I I : I : I II I I I : I I I II I II 
Db 877 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 936 



Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK--DEVHVSDEFSENRSSVSKASISPSNV 926 

I I II I I I I I I II I I I II I I I I I I : : : I I : I I : I I : I I : I : I 
Db 937 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 996 



Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 98 6 

III I I : II I I I I I I I I I I II I I I I II I I I I I : I I I I I I I I I I I I I I I I I I I 
Db 997 SALATQAEIES I VKPKVLVKEAEKKLPSDTEKEDRSPSAI FSAELSKTSWDLLYWRDI K 1056 

Qy 987 KTGWFGAS-LFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFR 1045 

I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1057 KTGWFGASAVFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFR 1116 

Qy 1046 AY LESEVAISEELVQKYSNSALGHV-NSTIKELRR LFLVDDLVDSLK-FAVLMW 1097 

I I I I : : I : I I I I I I I : I I : I I I I I I M I : I I I I I I 

Db 1117 AISGNLESCLYLRELGSGRYSNSALGSMWNCTVKGNFRAPSFFSWMDLVDSLRSFAVLMW 1176 

Qy 1098 VFTYVGALFNGLTLL IIALISLFSIPVIYERHQVQIDHYLGL^KSVKDAMAKI 1151 

I I I I II I I I : I III : I I I I I I I I I I I I I I I I I : I I I I I I I I 

Db 1177 VFTYVGCL — GLMVLDTTGFWALNFISSSGSWLIYERHQAQIDHYLGLANKNVKDAMAKI 1234 

Qy 1152 QAKIPGLKRKAD 1163 

I I I I I I I I I I I : 
Db 1235 QAKI P GLKRKAE 1246 



RESULT 15 
AAY71562 

ID AAY71562 standard; protein; 803 AA. 
XX 

AC AAY71562; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat Nogo A protein fragment used in the construction of mutant NiG. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 



KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer f s disease; Parkinson's disease; 

KW hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp. 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US02 6160 . 
XX 

PR 06-NOV-1998; 98US-0107 44 6P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Example; Page; 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of all 

CC central nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma, acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperprolif erative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders or diseases of the CNS. The present sequence 

CC is a fragment of rat Nogo A protein shown in AAY71310, which is used in 

CC the construction of mutant NiG. The mutant is composed of His-tag/T7- 

CC tag/Nogo-A sequence aa 172-974/His-tag . Nogo A deletion mutants were used 

CC for mapping the inhibitory sites of Nogo protein. Major inhibitory region 

CC was identified in the Nogo A sequence from amino acids 172-974, 

CC particularly amino acids 542-722. In addition, N-terminal region 1-171 

CC was found to be inhibitory to NIH 3T3 fibroblast spreading. Note: The 

CC present sequence is not given in the specification but is derived from 

CC rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 are referred 

CC in claim 32 and SEQ ID NO: 29 in disclosure of the specification. 

CC However, the specification does not include sequences for these SEQ ID 



CC numbers 
XX 

SQ Sequence 803 AA; 

Query Match 68.8%; Score 4023; DB 3; Length 803; 

Best Local Similarity 99.9%; Pred. No. 6e-202; 

Matches 802; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 172 VDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLST 231 

I I I I I I I I I I I M II I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I 1 I I I I I I I I I I I 
Db 1 VDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLST 60 

Qy 232 VS FKEHGYLGNLSAVS S S EGTI EETLNEASKELPERATNPFVNRDLAEFSELEYS EMGS S 2 91 

I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 VS FKEHGYLGNLSAVS S S EGTI EETLNEASKELPERATNPFVNRDLAEFSELEYS EMGS S 120 

Qy 292 FKGS PKGESAI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRVVS PEKTMDI F 351 

II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 FKGS PKGESAI LVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIF 180 

Qy 352 NEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQK 411 

I I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I M I I I I I I I I I I II I I I I I : I I II I I I 
Db 181 NEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCIEDSLEQK 240 

Qy 412 SLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTD 471 

I I I I II I I I I II I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 241 SLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTD 300 

Qy 472 EKKIEERKAQIITEKTSPKTSNPFLVAVQDSEAJ^YVTTDTLSKVTEAAVSNMPEGLTPDL 531 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 EKKIEERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDL 360 

Qy 532 VQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI 591 

I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I II I I II I I I I I I I I I I I I 
Db 361 VQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI 420 

Qy 592 VMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTK 651 

I I I II I I I I I M I I I I I I I I I I M II I I I I I I I I I M M I I I I I I I I I I I I I I I M I I I I 

Db 421 VMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTK 480 

Qy 652 EGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAEL 711 

I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 481 EGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAEL 540 

Qy 712 VEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELG 771 

II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I II I I I I II I I I II II 

Db 541 VEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELG 600 

Qy 772 KPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESE 831 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I II I I I I I I II I I I II I I I I I I I I I I 
Db 601 KPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESE 660 

Qy 832 TFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCD 891 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I II I M I I 
Db 661 TFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCD 720 

Qy 892 LSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKK 951 



Db 721 LSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKK 780 

Qy 952 LPSDTEKEDRSLSAVLSAELSKT 974 

I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 LPSDTEKEDRSLSAVLSAELSKT 803 



Search completed: September 29, 2004, 18:13:54 
Job time : 117.067 sees 



GenCore version 5.1-6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: September 29, 2004, 18:09:33 ; Search time 28.8717 Seconds 

(without alignments) 
2079.581 Million cell updates/sec 

Title: US-09- 830-972-2 

Perfect score: 5848 

Sequence: 1 MEDIDQSSLVSSSTDSPPRP VKDAMAKI QAKI P GLKRKAD 1163 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 389414 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1 : / cgn2_6/ptodata/2/iaa/ 5A_COMB . pep : * 

2 : /cgn2_6/ptodata/2/iaa/5B_COMB.pep : * 

3 : /cgn2_6/ptodata/2/iaa/ 6A_COMB . pep : * 

4 : /cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 

5: /cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep:* 

6: /cgn2_6/ptodata/2/iaa/backfilesl.pep:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


908 


15 


.5 


199 


2 


US- 


08- 


700- 


607- 


1 


Sequence 


1, Appli 


2 


789.5 


13 


.5 


776 


2 


US- 


08- 


700- 


607- 


5 


Sequence 


5, Appli 


3 


716.5 


12 


.3 


356 


2 


us- 


08- 


700- 


607- 


6 


Sequence 


6, Appli 


4 


685 


11 


.7 


208 


2 


US- 


08- 


700- 


607- 


7 


Sequence 


7, Appli 


5 


671 


11 


.5 


267 


2 


US- 


08- 


700- 


607- 


8 


Sequence 


8, Appli 


6 


539.5 


9 


.2 


168 


4 


us- 


09- 


149- 


476- 


563 


Sequence 


563, App 


7 


519 


8 


.9 


241 


2 


us- 


08- 


700- 


607- 


3 


Sequence 


3, Appli 


8 


302.5 


5 


.2 


8991 


4 


us- 


08- 


714- 


741- 


32 


Sequence 


32, Appl 


9 


288 


4 


.9 


2468 


4 


us- 


09- 


976- 


594- 


726 


Sequence 


726, App 


10 


286 


4 


.9 


92 


4 


us- 


09- 


149- 


476- 


411 


Sequence 


411, App 


11 


279.5 


4 


. 8 


1786 


3 


us- 


08- 


-973- 


462- 


8 


Sequence 


8, Appli 



12 


265.5 


4, 


5 


1780 


1 


us- 


08- 


769- 


309A-5 


Sequence 


5, Appli 


13 


265.5 


4 


5 


1780 


3 


us- 


08- 


994- 


570-5 


Sequence 


5 f Appli 


14 


265 


4 


5 


2137 


4 


us- 


09- 


134- 


001C-4463 


Sequence 


4463, Ap 


15 


256.5 


4 


4 


75 


4 


us- 


09- 


621- 


976-4600 


Sequence 


4600, Ap 


16 


256.5 


4 


4 


75 


4 


us- 


09- 


621- 


976-4601 


Sequence 


4601, Ap 


17 


254.5 


4 


4 


1596 


4 


us- 


08- 


978- 


277A-4 


Sequence 


4, Appli 


18 


233 


4 


0 


1142 


2 


us- 


08- 


993- 


118-7 


Sequence 


7, Appli 


19 


233 


4 


.0 


1142 


3 


us- 


08- 


845- 


528C-7 


Sequence 


7, Appli 


20 


233 


4 


.0 


1142 


4 


us- 


09- 


066- 


281B-7 


Sequence 


7 f Appli 


21 


233 


4 


.0 


1142 


4 


us- 


09- 


468- 


433C-7 


Sequence 


7, Appli 


22 


232.5 


4 


. 0 


1805 


1 


us- 


07- 


853- 


913-2 


Sequence 


2, Appli 


23 


230 


3 


.9 


3256 


4 


us- 


09- 


919- 


172-98 


Sequence 


98, Appl 


24 


230 


3 


. 9 


3256 


4 


us- 


09- 


976- 


594-22 


Sequence 


22, Appl 


25 


228 


3 


. 9 


1142 


3 


us- 


09- 


061- 


709-2 


Sequence 


2, Appli 


26 


228 


3 


. 9 


1142 


4 


us- 


09- 


899- 


651-2 


Sequence 


2, Appli 


27 


228 


3 


. 9 


1142 


4 


us- 


09- 


392- 


714-26 


Sequence 


26, Appl 


28 


227 


3 


. 9 


1270 


4 


us- 


07- 


757- 


022B-44 


Sequence 


44, Appl 


29 


227 


3 


.9 


1311 


4 


us- 


07- 


757- 


022B-42 


Sequence 


42, Appl 


30 


227 


3 


. 9 


1313 


4 


us- 


07- 


757- 


022B-142 


Sequence 


142, App 


31 


227 


3 


. 9 


1314 


4 


us- 


07- 


757- 


022B-50 


Sequence 


50, Appl 


32 


227 


3 


.9 


1320 


4 


us- 


07- 


757- 


022B-46 


Sequence 


4 6, Appl 


33 


227 


3 


. 9 


1320 


4 


us- 


07- 


-757- 


-022B-60 


Sequence 


60, Appl 


34 


227 


3 


. 9 


1354 


4 


us- 


-07- 


-757- 


-022B-48 


Sequence 


4 8, Appl 


35 


227 


3 


.9 


1361 


4 


us- 


■07- 


-757- 


-022B-40 


Sequence 


4 0, Appl 


36 


227 


3 


. 9 


1363 


4 


us- 


-07- 


-757- 


-022B-52 


Sequence 


52, Appl 


37 


227 


3 


. 9 


1404 


4 


us- 


-07- 


-757- 


-022B-2 


Sequence 


2, Appli 


38 


227 


3 


. 9 


1404 


4 


us- 


-07- 


-757- 


-022B-62 


Sequence 


62, Appl 


39 


225 


3 


. 8 


688 


3 


us- 


-09- 


-141- 


-047-8 


Sequence 


8, Appli 


40 


224.5 


3 


.8 


941 


4 


us- 


-07- 


-757- 


-022B-14 


Sequence 


14, Appl 


41 


224.5 


3 


. 8 


1022 


4 


us- 


-07- 


-757- 


-022B-84 


Sequence 


84, Appl 


42 


224 . 5 


3 


. 8 


1038 


4 


us- 


-07- 


-757- 


-022B-74 


Sequence 


74, Appl 


43 


224.5 


3 


. 8 


1049 


4 


us- 


-07- 


-757- 


-022B-58 


Sequence 


58, Appl 


44 


224.5 


3 


. 8 


1140 


4 


us- 


-07- 


-757- 


-022B-104 


Sequence 


104, App 


45 


224.5 


3 


.8 


1346 


2 


us- 


-08- 


-635- 


-121-2 


Sequence 


2, Appli 



ALIGNMENTS 



RESULT 1 
US-08-700-607-1 

Sequence 1, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP : 94304 



COMPUTER READABLE FORM: 
; MEDIUM TYPE: Diskette 

; COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ Version 1.5 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700, 607 
; FILING DATE: Filed Herewith 

ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
; REGISTRATION NUMBER: 36,74 9 

; REFERENCE/DOCKET NUMBER: PF-0114 US 

TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
; INFORMATION FOR SEQ ID NO: 1: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 199 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: single 

TOPOLOGY: linear 
; MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: 
; CLONE: Consensus 

US-08-700-607-1 



Query Match 15.5%; Score 908; DB 2; Length 199; 

Best Local Similarity 96.3%; Pred. No. l.le-50; 

Matches 184; Conservative 3; Mismatches 4; Indels 0; Gaps 



Qy 


973 


KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 

| | | | M 1 1 1 II 1 1 1 1 M 1 II 1 1 1 1 1 M M 1 1 M 1 1 1 1 1 1 1 1 1 I i i 1 1 i M 1 1 II i M 1 
KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYI7U^ALLSVTISFRIYKGVI 


1032 


Db 


9 


68 


Qy 


1033 


QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 

| | | | | | || | | | I I I I 1 1 1 1 II M 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M II 1 II 
QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 


1092 


Db 


69 


128 


Qy 


1093 


AVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERH 

| | | || | | | 1 1 1 1 II 1 1 1 1 II 1 M 1 : 1 1 1 II 1 1 1 1 M 1 M 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 

AVTjMWVFTWGALFNGLTLLILALISLFSVPVIYERHQAQID 


1152 


Db 


129 


188 



Qy 1153 AKIPGLKRKAD 1163 

I I I I I I I I I I : 
Db 189 AKIPGLKRKAE 199 



RESULT 2 
US-08-700-607-5 

Sequence 5, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION : 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 



TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP : 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE : FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700 , 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME : Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS : 
LENGTH: 77 6 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY : linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 307307 
US-08-700-607-5 

Query Match 13.5%; Score 789.5; DB 2; Length 776; 

Best Local Similarity 31.2%; Pred. No. 3.1e-42; 

Matches 243; Conservative 115; Mismatches 252; Indels 169; Gaps 28; 

PKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGT 546 

| : | | | : : : : I : I II : I : I : 
PABOSP — VAMF.T AS T GVAGVS S AMDHT F S T T S KD GEG SCYTSLI S 110 



QY 


487 


Db 


65 


Qy 


547 


Db 


111 


Qy 


600 


Db 


168 


Qy 


642 


Db 


228 



:iAYETKVDLVQTSEAIQ-ESLYPTAQLCPSFEEAEATPSPVLPDI — VMEAPLNS 599 

| | : | : : I I : : I I I I I I I I I : : I I 

)ICYPPQEDSTYFTGILQKENGHVTISESP EELGTPGPSLPDVPGIESRGLFSSDSG 167 

— LLPSAGASWQPSVSPLEAPPPVSY DSIKLEPENPPPYEEA M 641 

: |: | : ||: :| : :| : :: | |: 



1GIKEPE SFNAAVQETEAPYISIACDLIKETKLSTE-PSP 690 

I : : M : : : I I I II | | : | : : : : I 



Qy 



691 DFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLF-SDDSIPEV PQTQEEAVMLMK 745 



:|| I :: :|| : : I I I h : 

Db 281 VKITLTEIE PSVETTTQEKTPEKQDICLKPSPDTVPTVTVSEPEDDSPGSITPP 334 

Qy 746 ESLTEVSETVAQHK EERLSASPQELGKPYLESFQP NLHSTKDAASND 7 92 

I I I I : i I : I :: I : I I : I I : : 

Db 335 SSGTEPSAAESQGKGSISEDELITAIKEAKGLSYETAENPRPVGQLADRPEVKAR3GPPT 394 

Qy 793 I PTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKI 827 

II: ||:: : I I : : I I : 

Db 395 IPSPLDHEASSAESGDSEIELVSEDPM--AAEDALPSGYVSFGHVGGPPPSPASPSIQYS 452 

Qy 828 KESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEI ANIQSG 879 

: I I II I : I I I I : : I I I : : : : I I = 

Db 453 ILREEREAELDS ELI I E S CDAS SAS EES P KREQDS PPMKP SALDAI REETGVRAEER 509 

Qy 880 7VDSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIV 939 

II II 11:111 I I 1111 = 11 
Db 510 APSRRGLAEPG — SFLD-YPSTEPQPGPEL PPGDGALEPETPM 549 

Qy 940 KSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE LSKTSWDLLYWRD 984 

M | :: | : : : I : I : I I II I I I 

Db 550 LPRKPEEDSSSNQSPAATKGPGPLGPGAPPPLLFLNKQKAIDLLYWRD 597 

Qy 985 IKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPF 1044 

I | : | | : I I I : I I I III I I : I I I I I : I I I II I II I II II I : I I : I I : I I I I I I 
Db 598 IKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPF 657 

Qy 1045 RAYL E S EVAI S EE LVQ K Y S N SAL GHVN S T I KE L RRL FLVD D LVD S L K FAVLMWVFT YVGA 1104 

: | | I I | : : | : | : I I I : : : I I II : I II I I I I I I I I I I I II I I I II I : I I I I I 

Db 658 KAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGA 717 

Qy 1105 LFNGLTLLI LALI SLFSI PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I I I : : I : : I : I : : I I : I : II III I I I I : : I M I II I I I II I : 

Db 718 LFNGLTLLLMAWSMFTLPWYVKHQAQIDQYLGLVRTHINAWAKIQAKIPGAKRHAE 776 



RESULT 3 
US-08-700-607-6 

Sequence 6, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP : 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 



OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/700 , 607 
FILING DATE: Filed Herewith 
ATTORNEY/ AGENT INFORMATION: 
NAME : Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 356 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 307309 
US-08-700-607-6 

Query Match 12.3%; Score 716.5; DB 2; Length 356; 

Best Local Similarity 46.1%; Pred. No. 4.6e-38; 

Matches 164; Conservative 48; Mismatches 89; Indels 55; Gaps 7; 

Qy 828 KESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEI ANIQSGADS 882 

: I I II 1:11 I I : : I I I : : : : I I : I I 

Db 36 EEREAELDS ELIIESCDASSASEESPKREQDSPPMKPSALDAIREETGVRAEERAPS 92 

Qy 883 LPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSK 942 

1111:111 I I Nihil 
Db 93 RRGLAEPG--SFLD-YPSTEPQPGPEL PPGDGALEPETPM 129 

Qy 943 SLTKEAEKKLPSDTEKEDRSLSAVLSAE LSKTSWDLLYWRDIKK 987 

|| | : : | : : : hi : I I I I II I I I : 

Db 130 LPRKPEEDSSSNQSPAATKGPGPLGPGAPPPLLFLNKQKAIDLLYWRDIKQ 18 0 

Qy 988 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1047 

| | : I I I : I II I II Ihlll I I : I I I II I I I I I I I I • I : I I : I I : I I II I I : I I 
Db 181 TGIVFGSFLLLLFSLTQFSWSWAYIAJLAALSATISFRIYKSVLQAVQKTDEGHPFKAY 240 

Qy 1048 LESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFN 1107 

|| I : : I : I .: I I I : : : I I I I : I I I I I I I I I I I I I M I I I I I I I : I I I I M I I 

Db 241 LELEITLSQEQIQKYTDCLQFYWSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALFN 300 

Qy 1108 GLTLLILALI SLFSI PVT YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I | I I I : : I : : I : I : : I I : I : I I III I I I I : : I I I I I I II I II I : 
Db 301 GLTLLLMAWSMFTLPWYVKHQAQIDQYLGLVRTHINAWAKIQAKIPG7VKRHAE 356 



RESULT 4 
US-08-700-607-7 

; Sequence 7, Application US/08700607 
; Patent No. 5858708 



; GENERAL INFORMATION: 

; APPLICANT: Bandman, Olga 

; APPLICANT: Au-Young, Janice 

APPLICANT: Goli, Surya K. 

APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
; NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

; STREET: 3174 Porter Drive 

; CITY: Palo Alto 

; STATE: CA 

COUNTRY: U.S. 
ZIP: 94304 
; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Diskette 

; COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/700 , 607 

FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

; REGISTRATION NUMBER: 36,749 

REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 415-855-0555 

; TELEFAX: 415-845-4166 

INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 208 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: single 

TOPOLOGY: linear 
; MOLECULE TYPE: peptide 

IMMEDIATE SOURCE: 
; LIBRARY: GenBank 

CLONE: 307311 
US-08-700-607-7 

Query Match 11.7%; Score 685; DB 2; Length 208; 

Best Local Similarity 67.0%; Pred. No. 2.2e-36; 

Matches 128; Conservative 32; Mismatches 31; Indels 0; Gaps 0; 



973 KT S WDLL YWRDI KKTGWFGAS L FLLLS LTVFS I VS VTAYI ALALLS VT I S FRI YKGVI 1032 



I • : I I I ! M I I I • I I • I I I - • -ill t r i • 

18 KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 77 




Db 



QY 



1033 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 1092 



II • I i • I I I I I l • I i I 1 I • • I • I -Ml" • i I I I • I I I I I I ti 

78 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKF 137 





Db 



QY 



1093 AVLMWVFTYVGALFNGLTLLILALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQ 1152 



Mill- M M II II M I I I " I • • I • I • • M • i -ii • - i i i i 

138 AVTJytWLLTWGALFNGLTLLLMAWSMFTLPVVYVKHQAQIDQYLGLVRTHINAVVAKIQ 197 




Db 



Qy 1153 AKI PGLKRKAD 1163 

I I II I II I : 
Db 198 AKI PGAKRHAE 208 



RESULT 5 
US-08-700-607-8 

Sequence 8, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/7 0 0 , 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2 67 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 281046 
US-08-700-607-8 

Query Match 11.5%; Score 671; DB 2; Length 267; 

Best Local Similarity 66.3%; Pred. No. 2.5e-35; 

Matches 124; Conservative 33; Mismatches 30; Indels 0; Gaps 0; 

Qy 973 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1032 

I : : I I II I I I I I : I I : I M : I II III I I : II I I I : I I I II I I I I I I I I I : 
Db 9 KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 68 



Qy 



1033 QAI QKS DEGH P FRAYLES EVAI S EELVQKYSN S ALGHVNS T I KELRRLFLVDDLVD S LKF 1092 



i i • I I • I I I l l I • i i i i i * ■ i • i * ' ii"" * i f i i t i i i i i f 

69 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 128 





Db 



Qy 



1093 AVL>m^FTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQ 1152 



Db 




Qy 



1153 AKIPGLK 1159 



Db 



189 AKIPGAR 195 



RESULT 6 

US-09-149-476-563 

; Sequence 563, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 186 Human Secreted proteins 
; FILE REFERENCE: PZ002P1 

; CURRENT APPLICATION NUMBER: US/ 0 9/ 14 9 , 47 6 
; CURRENT FILING DATE: 1998-09-08 
; EARLIER APPLICATION NUMBER: PCT/US98/04493 
; EARLIER FILING DATE: 1998-03-06 

EARLIER APPLICATION NUMBER: 60/040,162 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,333 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/038,621 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,626 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,334 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,336 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,163 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/047,600 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,615 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,597 
; EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,502 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,633 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,583 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,617 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,618 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,503 



EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER : 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 



; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,589 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,593 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,614 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/043,578 

; EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/043,576 

; EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/047,501 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/043,670 

; EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/056,632 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,664 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,876 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,881 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,909 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,875 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,862 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,887 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,908 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/048,964 

; EARLIER FILING DATE: 1997-06-06 

; EARLIER APPLICATION NUMBER: 60/057,650 

; EARLIER FILING DATE: 1997-09-05 

; EARLIER APPLICATION NUMBER: 60/056,884 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/057,669 

; EARLIER FILING DATE: 1997-09-05 

; EARLIER APPLICATION NUMBER: 60/049,610 

; EARLIER FILING DATE: 1997-06-13 

EARLIER APPLICATION NUMBER: 60/061,060 

; EARLIER FILING DATE: 1997-10-02 

Query Match 9.2%; Score 539.5; DB 4; Length 168; 

Best Local Similarity 59.3%; Pred. No. 3.2e-27; 

Matches 99; Conservative 36; Mismatches 31; Indels 1; Gaps 1; 

Qy 998 LLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEE 1057 

: I I I I I I : : I I : i : I 1 I I I I I I I I I I I I I I I I : I I I : I I I I I : I I I : : : : I I 
Db 1 MLLSLAAFSVISWSYLILALLSVTISFRI YKSVIQAVQKSEEGHPFKAYLDVDITLSSE 60 

Qy 1058 LVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVIM^FTWGALFNGLTLLILALI 1117 

I I : I : hi : I : I II I I : I i I t I t I II I I : I I I I I : I 1 I : I I I I I I : 
Db 61 AFHNYMNAAMV1IINRALKLIIRLFLVEDLVDSLKIAVFMWLMTYVGAVFNGITLLILAEL 120 



Qy 1118 SLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGL-KRKAD 1163 

: I I : I : : I I : : : I I I i I : I : I I : I I I i I : I I : I : I I : 

Db 121 LIFSVPIVYEKYKTQIDHYVGIARDQTKSIVEKIQAKLPGIAKKKAE 167 



RESULT 7 
US-08-700-607-3 

Sequence 3, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Bandrnan, Olga 
Au-Young, Janice 
Goli, Surya K. 
Hillman, Jennifer L. 
TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/700 , 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 241 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 

LIBRARY: THP1NOB01 
CLONE: 31870 
US-08-700-607-3 

Query Match 8.9%; Score 519; DB 2; Length 241; 

Best Local Similarity 59.4%; Pred. No. l.le-25; 

Matches 95; Conservative 33; Mismatches 32; Indels 0; Gaps 0; 



Qy 



972 S KTS WDLLYWRDI KKTGWFGASLFLLLS LTVFS I VSVTAYI ALALLSVT I S FRI YKGV 1031 
I : I II: I I I : I I I I III : I Mill I I :: I I : I : I I I I I I II I II I II I 



Db 



44 SSCAVHDLIXWRDVKKTGFVFGTTLIMLLSLAAFSVISWSYLILALLSVTISFRIYKSV 103 



Qy 1032 IQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLK 1091 

II I : I I I : I I I I I : I I I : : : : 1 I I I : I : I : I : I : I I I I I : I I I I I I I 

Db 104 IQAVQKSEEGHPFKAYLDVDITLSSEAFHNYMNAAMVHINRALKLIIRLFLVEDLVDSLK 163 

Qy 1092 FAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQV 1131 

II ||: I I I I I : I I I : I I I I I I : : I : I : : I : : : I 
Db 164 LAVFMWLMTYVGAVFNGITLLILAELLIXSVPIVYXKYKV 2 03 



RESULT 8 

US-08-714-741-32 

Sequence 32, Application US/08714741 
Patent No. 6500613 
GENERAL INFORMATION: 

APPLICANT: Briles, David E. 
APPLICANT: McDaniel, Larry S. 
APPLICANT: Swiatlo, Edwin 
APPLICANT: Yother, Janet 
APPLICANT: Crain, Marilyn J. 
APPLICANT: Hollingshead, Susan 
APPLICANT: Tart, Rebecca 
APPLICANT: Brooks-Walter, Alexis 

TITLE OF INVENTION: PNEUMOCOCCAL GENES, PORTIONS THEREOF, 

TITLE OF INVENTION: EXPRESSION PRODUCTS THEREFROM, AND USES OF SUCH GENES, 
TITLE OF INVENTION: PORTIONS AND PRODUCTS 
NUMBER OF SEQUENCES: 47 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Curtis, Morris & Safford, P.C. 
STREET: 530 Fifth Avenue 
CITY: New York 
STATE: New York 
COUNTRY: U.S. 
ZIP: 10036 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/714, 741 
FILING DATE: 16-SEP-1996 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: Frommer Esq., William S. 
REGISTRATION NUMBER: 25,506 
REFERENCE/DOCKET NUMBER: 454312-2460 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212) 840-3333 
TELEFAX: (212) 840-0712 
INFORMATION FOR SEQ ID NO: 32: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 8 991 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 



MOLECULE TYPE: amino acid 
US-08-714-741-32 

Query Match 5.2%; Score 302.5; DB 4; Length 8991; 

Best Local Similarity 22.1%; Pred. No. 1.6e-09; 

Matches 244; Conservative 125; Mismatches 439; Indels 297; Gaps 47; 

Qy 18 PRPPPAFKYQFWEPEDEEDEEEEEDEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAP 77 

1:111: : I I : I : : I I I I II I : : Mi I 

Db 7805 PKPAPAPQPAPAPKPEKTDDQQAEEDYARRSE — EEYNRLPQQQ PPKAEKP 7 8 53 

Qy 7 8 LLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPP 137 

: I I I : I I I I I I I I I I I I I I I I 

Db 7854 APAPKPEQPVPAEXPENP APAPK — PAXAPQPLKPEEPAEQP 7893 

Qy 138 ARPPPPPPAGASPLAEPAAPPSTPAAP — KRRGSGSVDETLFALPAASEPVI PSSAEKIM 195 

I IN III I III: 1:1 I : I I 

Db 7894 KPEKPEEPAGQ PEPEKPDDQQAGEDYARRSGGEYNRFPQQQPPKAEK — PAPAPK — 7946 

Qy 196 DLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSF KEHGYLGNLSAVSS 248 

III I II : I : I : I I III: 

Db 7947 — PEQP VPAPKTLLKKAKLAGAKSKAATKKAELEPELEKAEAELENLLSTLD 7996 

Qy 249 SEGTIEETLNEASKELPERATNPFVNRDLAEFSEL — E YS EMGS S FKGS PKGES AI L VEN 306 

II : I : I I I I II : I I I I I : : : I : : : 

Db 7997 PEG KTQDELDKEAAEAELNKKVEALPNQVSELEEELSKLEDNLKDAETNNVEDYIKE 8053 

Qy 307 TKEEVIVRSK DKEDLVCSAALHSPQES PVGKEDRWS PEKTMDI FNEMQMS WAPVR 363 

II I : : I II I : : I I I : II. I I 

Db 8054 GLEEAIATKQAELEKTPKELDAALN — ELGPDGDEEETPPPE APAE 8097 

Qy 364 EEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLED — SLEQKSLGKDSEGRN 421 

: : I I I : I I I I I I : I : I I : I : 

Db 8098 QPKPE-KPAEETPAPAPKPEKSADQQA EEDYARRSEEEYNRLTQQQPPKAEKPAP 8151 

Qy 422 EDASFPSTPEPVKDSSRAYITCAS FTSA TESTTANTFPLLEDHT 465 

I I I I Ml I II II 

Db 8152 APAPKPEQPAPA-PKSRGLATKKKLNLAEARIELLLKKLGLEPGLEKAGAGLGNLLSTLD 8210 

Qy 466 SENKTDE KKIEERKAQI — ITEKTSPKTSNPFLVAVQDSEADYVTTDT 511 

III: I I : I | : : | : | | : : | : | : : | 

Db 8211 PEGKTQDELDKEAAEAELNKKVEALPNQVAELEEELSKLEDN LKDAETNHVEDYI 82 65 

Qy 512 LSKVTEAAVSNMPE GLTPDLVQE ACESELNEATGTKIAYET 552 

: I I : I I I I : I I I : : I I I I 

Db 8266 KEGLEEAIATKQAELEKTPKELDAALNELGPDGDEEETPAPEAPAEQPKPEKPAEETPAP 8325 

Qy 553 KVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAP 596 

: I : I I I I I I I : I : I I : II 

Db 8326 APKPEKSADQQAEEDYARRS EEEYNRL — TQQQPPKAEKPAPAPAP-KPEQPAPAPKKKQ 8382 

Qy 597 LNSLLPSA GASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAM 641 

I :: I I II::: III : I I I I I : 

Db 8383 KWLENLLSTLDPGGKTQDELDKGAAEAELNKKVEALPNPVX ELEEELSPPEDN — 8436 

Qy 642 NVALKALGT KEGIKE PES FN AAV QETEAPYIS 673 

M I I I I : : I I : : I I : : I I I I 



Db 8437 LKDAETNHVEDYIKEGLEEAIATKQAELEETPQEVDAALNDLVPDGGEEETPAP 8490 

Qy 674 IACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE LVEDSSPESE 720 

: : : I : I : : I I I I : I I I : I : : I 

Db 8491 APQPDEPAPAPAPNAEQPAPAPKPEKSADQQAEEDYARRSEGEYNRLTQQQPPKAE 8546 

Qy 721 -PVDLFSDDSIPEVPQTQEEAVML MKESLTEVSETVAQHKEERLSASPQ 768 

I : I : I I : : : : | | : | : | : : | 

Db 8547 KPAPAPAPKPEQPAPAPNKEIARLQSDLKDAEENNVEDYIKEGLEQAITNKKAELATTQQ 8606 

Qy 769 ELGKPYLESFQPNLHSTKDAASNDIPTLTKKE KISLQMEEFNTAI YSNDDLL 820 

: I : I I I: I I : I :::::{ : :: | 

Db 8607 NIDKTQKDLEDAELELEKVLATLDPEGKTQDELDKEAAEAELNEKVEALQNQVAELEEEL 8666 

Qy 821 SSKEDKIKESETFSDSSPI-EIIDEFPTFVSAKDDSPKIAKEYTDLEVSDKSEIANIQSG 87 9 

I I I : I : : I I : I I : : I : I : : I I : I 
Db 8667 SKLEDNLKDAETNNVEDYI KEGLEE AIATKKAELEKTQKE 8706 

Qy 880 ADSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIV 939 

I : : I : : I I : I I I I : I 
Db 8707 LDAALNELGPDGD EEETPAPAPQPEKPAEEPEN-PAPAPKPE 8747 

Qy 94 0 KSKSLTKEAEKKLPSDTEKEDRSLS 964 

II : : I I : : I : I I : 

Db 874 8 — KSADQQAEEDYARRSEEEYNRLT 877 0 



RESULT 9 

US-09-976-594-726 

; Sequence 726, Application US/09976594 

; Patent No. 6673549 

; GENERAL INFORMATION: 

; APPLICANT : Furness, Michael 

; APPLICANT : Buchbinder, Jenny 

; TITLE OF INVENTION: GENES EXPRESSED IN C3A LIVER CELL CULTURES TREATED WITH 
STEROIDS 

; FILE REFERENCE: PA-0041 US 

; CURRENT APPLICATION NUMBER: US/09/97 6,594 

; CURRENT FILING DATE: 2001-10-12 

; PRIOR APPLICATION NUMBER: 60/240,409 

; PRIOR FILING DATE: 2000-10-12 

; NUMBER OF SEQ ID NOS : 1143 

; SOFTWARE : PERL Program 

; SEQ ID NO 726 

; LENGTH: 2468 

; TYPE: PRT 

; ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY: misc_feature 

OTHER INFORMATION: Incyte ID No. 6673549 4295277CD1 
US-09-976-594-726 

Query Match 4.9%; Score 288; DB 4; Length 2468; 

Best Local Similarity 20.2%; Pred. No. 2e-09; 

Matches 251; Conservative 166; Mismatches 441; Indels 386; Gaps 56; 



Qy 



13 STDSPPR— PPPAFKYQFVTEPEDEEDEEE EED 



EEEDDED 50 



Db 625 ATDVKPKAAKEKTVKKETKVKPEDKKEEKEKPKKEVAKKEDKTPIKKEEKPKKEEVKKEV 684 

Qy 51 LEELEVLERKPAAGLSAAAVP PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWER 110 

:|:: |:| | : : : I : : ::: |: 

Db 685 KKEIKKEEKK EPKKEVKKETPPKEVKKEVKKEEKKEVKKE— EK 726 

Qy 111 SPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPL7VEPAAPPSTPAAPKRRG — 168 

I III : | | : | I I I I : I II : I 

Db 727 EPKKEIKKLPKDAKKSSTPLSEAKKPAALKPKVPKKEESVKKDSVA AGKPKEKGKI 7 82 

Qy 169 SGSVDETLFAL PA AS E P VI P S S AEKIMDLME 199 

I I : I II I : : I : I I 

Db 7 83 KVIKKEGKAAEAVAAAVGTGATTAAVMAAAGIAAIGPAKELEAERSLMSSPEDLTKDFEE 842 

Qy 200 QPGNTVSSGQEDFPSV-LLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLN 258 

I : : | : | : | I I : : : : I I I I I I 

Db 843 LKAEEVDVTKDIKPQLELIEDEEKLKETEPVEAYVIQKEREVTKGPAESPDEG-ITTTEG 901 

Qy 259 EASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTK-EEVIVRSKD 317 

I I I : : : : I : I : I : I I : I : I I : I 

Db 902 EGECEQTPEELEPVEKQGVDDIEKFE — DEGAGFEES — SETGDYEEKAETEEAEEPEED 957 

Qy 318 KEDLVC-SAALHSP QES PVGKEDRWS PEKTMDI FNEMQMSV 358 

I : II I I : I I I : I I : I I I : II I 
Db 958 GEEHVCVSASKHSPTEDEESAKAEADAYIREKRESVASGDDRA EEDMD EAIEKG 1011 

Qy 359 VAPVREEYADFKPFEQAWEVK-DTYEGSRDVLAARANVESKVDRKCLEDSLEQ 410 

I II II: :: I : : : I I : : I I : I I : I : 

Db 1012 EAEQSEEEADEE — DKAEDAREEEYEPEK — MEAEDYVMAWDKAAEAGGAEEQYGFLTT 1067 

Qy 411 — KSLGKDSEGR NEDASFPSTPEPVKDSSRAYITCAS FTSATESTT 454 

I I I I I I : | : | | : | : : : I : I : 

Db 1068 PTKQLGAQSPGREPASSIHDETLPGGSESEATASDEENREDQPEEFTATSGYTQSTIEIS 1127 

Qy 455 ANTFPL LEDHTSENKTDEK KIEERKAQIITEKTS-PKTSNPFLVAV 499 

: I: : | I: :|: I ::::::: | | | 

Db 112 8 SEPTPMDEMSTPRDVMSDETNNEETESPSQEFVNITKYESSLYSQEYSKP7VDVTPLNGFS 1187 

Qy 500 QDSE71DYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDL 556 

: I : I II I : : I : I : I I : I : : : I : I : 
Db 118 8 EGSKTD — ATDGKDYNASASTISPPSSMEED KFSRSALRDAYCSEVKASTTLDIKDS 1242 

Qy 557 — VQTSEAIQESLYPTAQLCP SFE EAEATPSPVLPDIV 592 

: I I : I I : I : I i I I : I I I : : 

Db 1243 ISAVSSEKVSPSKSPSLSPSPPSPLEKTPLGERSVNFSLTPNEIKVSAEAEVAPVSPEVT 1302 

Qy 593 ME APLNSLLPSAG ASWQ 610 

I : I I : I I I I :: 

Db 1303 QEWEEHCASPEDKTLEWSPSQSVTGSAGHTPYYQSPTDEKSSHLPTEVIEKPPAVPVS 1362 

Qy 611 PSVSPLEAPPPVSYDSIK— LEPENPPP YEEAMNVALKALG 649 

I I I I : : I I I I : II II II : : III 

Db 1363 FEFSDAKDENERASVS PMDEPVPDSESPIEKVLSPLRSPPLIGSESAYESFLSADDKASG 1422 

Qy 650 TKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPD-FSNYSEIA KF 701 

: I : I : II : I I I I I I : I 



Db 



1423 



RGAESPF EEKS GKQGS PDQVS P VS EMT S T S L YQDKQ 1458 



Qy 702 EKSVPEHAELVEDSSPE SEPVDLFSDDSIPEVPQTQ EEAV 741 

I : I : i I I I : I : : : I I I | : 

Db 1459 EGKSTDFAPIKEDFGQEKKTDDVEAMSSQPALALDERKLGDVSPTQIDVSQFGSFKEDTK 1518 

Qy 742 MLMKE SLTEVSETVAQ HKEERLSASPQELG KPYLESFQPNLHSTK 786 

I : I I I I I I I : II II : : I : I : I I 
Db 1519 MSISEGTVSDKSATPVDEGVAEDTYSHMEGVASVSTASVATSSFPEPTTDDVSPSLH 1575 

Qy 787 DAASNDIPTLTK-KEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEF 845 

I I I : : : I : : : I : : : I I I : : 
Db 1576 — AEVGSPHSTEVDDSLSVSWQTPTT-FQETEMSPSKEECPR 1615 

Qy 846 PTFVSAKDDSPKLAKEYTDLE — VSDKSEIANIQSGADSLPCLELPCDLSFKN 896 

I : I I I I I II I : : I : : I : : I : I : I I I II:: 

Db 1616 PMSISPPDFSPKTAKSRTPVQDHRSEQSSM-SIEFGQES-PEQSLAMDFSRQSPDHPTVG 1673 

Qy 8 97 IYPKDE-VHVSDEFSENRSSVSKASISPSNV 926 

I I : I : I III: III 

Db 1674 AGVLHITENGPTEVDYSPSDMQDSSLSHKIPPMEEPSYTQDNDLSELISVSQVEASPSTS 1733 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE 970 

II I : : : I : : : I : I : I I I I I :: I 

Db 1734 SAHTP-SQIASPLQEDTLSDVAPPR DMSLYASLTSE 1768 



RESULT 10 
US-09-149-476-411 

; Sequence 411, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 186 Human Secreted proteins 

; FILE REFERENCE: PZ002P1 

; CURRENT APPLICATION NUMBER: US/ 09/ 14 9 , 47 6 

; CURRENT FILING DATE: 1998-09-08 

; EARLIER APPLICATION NUMBER: PCT/US 98/ 044 93 

; EARLIER FILING DATE: 1998-03-06 

; EARLIER APPLICATION NUMBER: 60/040,162 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,333 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/038,621 

EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,626 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,334 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,336 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,163 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/047,600 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,615 

; EARLIER FILING DATE: 1997-05-23 



EARLIER APPLICATION NUMBER: 60/047,597 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,502 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,633 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,583 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,617 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,618 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,503 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 



EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,593 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,614 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,578 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER : . 60/043, 576 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/047,501 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,670 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,881 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,875 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,862 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,887 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,908 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/048,964 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/057,650 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/056,884 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,669 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/049,610 
EARLIER FILING DATE: 1997-06-13 
EARLIER APPLICATION NUMBER: 60/061,060 



EARLIER FILING DATE: 1997-10-02 



Query Match 4.9%; Score 2 86; DB 4; Length 92; 

Best Local Similarity 56.0%; Pred. No. 2.2e-ll; 

Matches 51; Conservative 21; Mismatches 19; Indels 0; Gaps 0; 

Qy 1064 NSALGHWSTIKELRRLFLVDDLVDSLKFAVXMWVFTYVGALFNGLTLLIL7VLISLFSIP 1123 

I : I : 1:1 : I : I I I I I : I I I I I t I II I 1 : I I I I I : I I I : I I I I I I : : I I : I 
Db 2 NAAMV11INRALKLIIRLFLVEDLVDSLKLAVFMWLMTYVGAVFNGITLLILAELLIFSVP 61 

Qy 1124 VIYERHQVQIDHYLGLANKSVKDAMAKIQAK 1154 

: : I I : : : I I I I I : I : I I : I I : I 

Db 62 IVYEKYKTQIDHYVGIARDQTKS I VEKI PSK 92 



RESULT 11 
US-08-973-462-8 

Sequence 8, Application US/08973462B 
Patent No. 6191270 
GENERAL INFORMATION: 
APPLICANT: DRUILHE, PIERRE 
APPLICANT: DAUBERS I ES , PIERRE 

TITLE OF INVENTION: MALARIAL PRE-ERYTHROCYTIC STAGE POLYPEPTIDE MOLECULES 
FILE REFERENCE: 0660-0125-0 PCT 
CURRENT APPLICATION NUMBER: US/ 08/ 973, 4 62 B 
CURRENT FILING DATE: 1998-02-06 
EARLIER APPLICATION NUMBER: PCT/ FR96/ 008 94 
EARLIER FILING DATE: 1996-06-12 
EARLIER APPLICATION NUMBER: FR 95/07007 
EARLIER FILING DATE: 1995-06-13 
NUMBER OF SEQ ID NOS : 2 9 
SOFTWARE: PatentlnVer. 2.0 
SEQ ID NO 8 
LENGTH: 1786 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: Description of Artificial Sequence : Polypeptide 
US-08-973-462-8 

Query Match 4.8%; Score 279.5; DB 3; Length 1786; 

Best Local Similarity 20.1%; Pred. No. 4.4e-09; 

Matches 261; Conservative 220; Mismatches 518; Indels 297; Gaps 60; 

Qy 33 EDEEDEEEEEDEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDS VP 87 

I I : II III : I : : II II I I : : I : I I I I I 

Db 225 EKVEESVEENDEESVEENVEE-NVEENDDG SVASSVEESIASSVDESIDSSIEENVA 280 

Qy 88 P APRGPLPAAPPAAPERQPSWERSPA APAPSLPPAAAV-LPSKLPEDDE 135 

I II II : : | | I : | : : : | : : | : | 

Db 281 P T VE E I VAP S WE S VAP S VE E S VE EN VE E S VAEN VE E S VAEN VE E S VAENVE E S VAENVE 34 0 

Qy 136 PPARP P P P P PAGAS P LAE P AAP P S T PAAP KRRGS GS VDET L FAL PAAS E PVI P S 189 

I I : I : I I : : III: : : I I I 

Db 341 EIVAPTVEEIVAPTVEEIVAPSWESVAPSVEESVEENVEESVAENV--EESVAENVEES 398 

Qy 190 SAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASL— PSLSPLSTVSFKEHGYLGNLSAVS 247 



11*" I II' • • I I - - I I I 

Db 399 VAENVEESVAE — N VE E S VAENVE E I VAP T VE E I VAP T VE E I VAP S WE SVAP 449 

Qy 248 SSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENT 307 

I I :: I I : I : I I : I : I I I I I : I I 

Db 450 SVEESVEENVEESVAENVEESVAENVEES VAENVE ES — VAENV 4 91 

QY 308 KEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYA 367 

= 11 : : : I : I : : I I : : I : I I : : : I I I | | 

Db 4 92 EESV AENVEESVA EN VE E S VAENVE E I VAP - - T VE EIVAPTVEEIV 535 

Qy 368 DFKPFEQ-AWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGRN-EDAS 425 

I I I : : : I : : I I I | | : I : I : : : : I | | : 

Db 536 AP S WES VAP S VEE SVEENVEE SVAE-NVEESVAEN- VEE S VAENV — EESVAENVEEIV 591 

Qy 426 FPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKAQIITE 485 

I : I I : : III I : I : II : I I I : : | 

Db 592 APTVEEIVAPTVEEIVA PSWESVA PSVEESVEEN VEES VAENVE E 637 

Qy 486 KTSPKTSNPFLVAVQDSEADYWTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATG 545 

= I : : I I I I : : | : | I : I : I : | : 

Db 638 SVAENVEESVAENVEEIVAPTVEEIVAPTVEEIVAPSWESVAPS-VEESVEENVEESVA 696 

Qy 546 TKIAYETKVDLVQTS — EAIQESLYPTAQ — LCPSFEEAEATPSPVLPDIVMEAPLNSLL 601 

: I : : I : I I :: I I : : : I : I I I I : : I I : : | : 
Db 697 ENVE-ESVAENVEESVAENVEESVAENVEEIVAPTVEEIVA PTVEEIVAPSWESVA 752 

Qy 602 PSAGASV VQPSVSPLEAPPPVSYDSIKLE 630 

M M 11:1:11 :| 
Db 753 PSVEESVEENVEESVAENVEESVAENVEESVAENVEESVAPTVEEIVAP SVE 8 04 

Qy 631 PENPPPYEEAM NVALKALGTKEGI KEPES FNAAVQETEAP YI S IACDLI K 680 

III:: I : I I I : : I : : I : : : : : : 

Db 805 ESVAPSVEESVAENVATNLSDNLLSNLLGGIETEEIKDSILNEIEEVKENWTTILENVE 864 

Qy 681 ETKLSTEPSPDFSNYSEIAK FEKSVPEHAELV EDSSPESEPVDLF — 725 

II : I Mil: |: : | | | |:: | | :: 

Db 865 ET— TAESVTTFSNILEEIQENTITNDTIEEKLEELHENVLSAALENTQSEEEKKEVIDV 922 

Qy 726 SDDSIPEVPQTQEEAVMLMKESLTE VSETV 755 

I :: I I : : II : I :: I I I I 

Db 923 IEEVKEEVATTLIETVEQAEEKSANTITEIFENLEENAVESNENVAENLEKLNETVFNTV 982 

Qy 756 AQHKEERLSAS PQELGKP YLES — FQPNLHSTKDAASNDI PTLTK — KEKISLQMEE 808 

I I : I : I :: I : I I : : : : | : | | | 

Db 983 LDKVEETVEISGESLENNEMDKAFFSEIFDNVKGIQENLLTGMFRSIETSIVIQSEEKVD 1042 

Qy 809 FNTAIYSN — DDLLSSKEDKIKESETFSDSSPI-EIIDEF PTFVS 850 

I : I : I : : : I I : : I I : : I : I | : 

Db 1043 LNENWS S I LDNI ENMKEGLLNKLENI S STEGVQETVTEHVEQNVYVDVDVPAMKDQFLG 1102 

Qy 851 AKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPKDEVHVSDEFS 910 

: : : I : : : I I III : I : I : : | : | : | : : | 

Db 1103 ILNEAGGLKEMFFNLEDVFKSE S DVI T VEEI KDE P VQKEV- EKETVS 1 1 EEME 1154 

Qy 911 ENRSSV SKASISPSNVSALEPQTEMGSIVKSKSLT-KEAEKKLPSDTEK EDR 961 

M I I : Ml Ml I : : : | : | | : |: | 



Db 



1155 ENIVDVLEEEKEDLTDKMIDAVEESIEISSDSKEETESIKDKEKDVSLWEEVQDNDMDE 1214 



Qy 



Db 



962 SLSAVL SAELSKT SV-VDLL YWRDI KKTGWFGASLFLLLS LTVFS I VSVTAYI A 1015 

I : II | | | : | : : : : I : : I : | : : 
1215 SVEKVLELKNMEEELMKDAVEINDITSKLIEETQELNEVEADLIKDME K 1263 



Qy 



1016 LALLS VT I S FRI YKGVI QA IQK-SDEGHPFRAYLESEVA ISEELVQKYSN — S 1065 



Db 



12 64 LKELEKALS-EDSKEIIDAKDDTLEKVIEEEHDITTTLDEWELKDVEEDKIEKVSDLKD 1322 



Qy 



Db 



1066 ALGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTWGALFNGLTLLILALISLFSIPVI 1125 

: : I I : : I : : : : I : : : 

1323 LEEDILKEVKEIKE — LESEILEDYK ELKTIETDIL 1356 



Qy 



1126 YERHQVQI DHYLGLANKS — VKDAMAKIQAKIPGLK 1159 



Db 



1357 EEKKEIEKDHFEKFEEEAEEI KDLEADI LKEVSSLE 1392 



RESULT 12 
US-08-769-309A-5 

; Sequence 5, Application US/08769309A 
; Patent No. 5741890 
; GENERAL INFORMATION: 

APPLICANT: Scott, John D . , 

APPLICANT: Nauert, Brian J., 

APPLICANT: Klauck, Theresa M. 

TITLE OF INVENTION: Protein Binding Domains of Gravin 
NUMBER OF SEQUENCES: 24 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O' Toole, Gerstein, Murray & Borun 
STREET: 6300 Sears Tower/233 South Wacker Drive 
CITY: Chicago 
; STATE: Illinois 

COUNTRY: United States of America 
ZIP: 60606-6402 
; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: Patentln Release #1.0, Version #1.30 

; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/769 , 309A 

FILING DATE: 

CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: No. 5741890and, Greta E. 

REGISTRATION NUMBER: 35,302 
; REFERENCE/DOCKET NUMBER: 27866/33451 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 312-474-6300 

TELEFAX: 312-474-0448 

TELEX: 25-3856 
; INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 178 0 amino acids 

TYPE: amino acid 



TOPOLOGY: linear 
; MOLECULE TYPE: protein 
US-08-769-309A-5 

Query Match 4.5%; Score 265.5; DB 1; Length 1780; 

Best Local Similarity 20.8%; Pred. No. 3.4e-08; 

Matches 264; Conservative 146; Mismatches 486; Indels 371; Gaps 51; 

Qy 11 SSSTDSPPRPPPA FKYQFVTE PEDE EDEEEEEDEEED 47 

I I : I I I : I I : I I : ill I ::|:| |: I 

Db 277 SKSAESPTSPVTSETGSTFK-KFFTQGWAGWRKKTSFRKPKEDEVEASEKKKEQEPEKVD 335 

Qy 48 DEDLEELEVLERKPAAGLSAAAVPPAAAA — PLLDFSSDSVPPAPRGPLPAAPPAAPERQ 105 

I : : I I II I I I : I I I : I II: I I 

Db 336 TEEDGKAEVASEKLTASEQAHPQEPAESAHEPRLSAEYEKV ELPS EEQ 383 

Qy 106 PSWERSPAAPAPSLPPAAAVLPSKLP EDDEPPARPPPPPPAGASP 150 

I : I : I : I I I I : I : : I I : 

Db 384 VSGSQGPSEEKPA-PLATEVFDEKIEVHQEEWAEVHVSTVEERTEEQKTEVEETAGSVP 442 

Qy 151 L AEP — AAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQP 201 

I I I I I I I I I : I I : I :: : I 

Db 443 AEELVGMD7VEPQEAEPAKELVKLKETCVSGEDPTQGADLSPDEKVLSKPPEGWSEVEML 502 

Qy 2 02 GNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLG NLSAVSSSEGTI 253 

: : | | | : | | | I : I I : : I : 

Db 503 SSQERMKVQGSPLKKLFTSTGLKKLS GKKQKGKRGGGDEESGEHTQVPADSPDSQ 557 

Qy 254 EETLNEASKELPERATN-PFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTKEEVI 312 

II I : I II : : | | | : : I I : : I I I : I : I 

Db 558 EEQKGESSASSPEEPEEITCLEKGLAEVQQDGEAEEGATSDGEKKREGVTPWAS FKKMVT 617 

Qy 313 VRSK DKE DLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMSWAPV 362 

: : III I I I I I I : : : : I | : | ! | 

Db 618 PKKRVRRPSESDKEDELDKVKSATLSSTEST ASEMQEEMKGSVEEPK 664 

Qy 363 REEYADFKPFEQAWEV KDTYEGS 385 

II : I I III: 

Db 665 PEEPKRKVDT SVSWEALI CVGS SKKRARRRS S SDEEGGPKAMGGDHQKADEAGKDKETGT 724 

Qy 386 RDVLA ARANVESK VDRKCLEDSLEQKSLGKDS 417 

: I I I : I : M : I I : I I : I I 

Db 725 DGILAGSQEHDPGQGSSSPEQAGSPTEGEGVSTWESFKRLVTPRKKSKSKLEEKS — EDS 782 

Qy 418 -EGRNEDASFPSTPEPVKDSS RAY I T CAS FT SAT E — ST 453 

I : I I I II I : I : I : I I I I I 

Db 783 IAGSGVEHSTPDT-EPGKEESWVSIKKFIPGRRKKRPDGKQEQAPVEDAGPTGANEDDSD 841 

Qy 454 TANTFPLLEDHTSENKTDE KKIEERKAQI ITEKTSPKTSNPFLVAVQD 501 

III I : I : : | : : | : : : : | : I I I 

Db 842 VPAWPLSEYDAV^REKMEAQQAQKGAEQPEQKAATEVSKELSESQVTIMMAAAVADGTRA 901 

Qy 502 SEADYVTTDTLSK VTEAAVSNMPEG- 52 6 

: I I : I : I : III : I I 

Db 902 ATIIEERSPSWISASVTEPLEQVEAEAALLTEEVLEREVIAEEEPPTVTEP LPENR 957 

Qy 527 LTPDLVQEA-CESELNEATGTKI-AYETKVDLVQTSEAIQESLYPTAQ 572 



Db 958 EARGDTWSEAELTPEAVTAAETAGPLGSEEGTEASAAEETTEMVSAVSQLTDSPDTTEE 1017 

Qy 573 L CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVS 623 

I I I I II:: I I I I I : II 

Db 1018 ATPVQEVEGGVPDIEEQERRTQEVLQAVAEKVKEESQLPGTG GPEDVLQPVQ 1069 

Qy 624 YDSIKLEPENPPPYEEA MNVALK ALGTKEGIKEPESFNAAVQE 666 

: I I I II : I II I I I I I I I I 

Db 1070 RAEAERPEEQAEASGLKKETDVVLKVDAQEAKTEPFTQGKWGQTTPESFEKAPQV 1125 

Qy 667 TEAPYISIACDLIKETKLSTEPSPD — FSNYSEIAKFEKSVPEHAELVEDSSPES — EPV 722 

II: I : : : I I : I : I :: : I I : I I 

Db 1126 TES IESSELVTTCQAETLAGVKSQEMVMEQAIP PDSVETPT 1166 

Qy 723 DLFSDDSIP EVPQTQEEAVMLMKESLTEVSETVAQH KEERLSASPQELGKPY 774 

I : I I I : I I : I : : I : I I : I I : : I 

Db 1167 DSETDGSTPVADFDAPGTTQ KDEIVEIHEENEVHLVPVRGTEAEAVPAQKERPP 1220 

Qy 775 LESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFS 834 

I II: : I : : I : : : I I : : I : : : : 

Db 1221 APSSFVFQEETKE QSKMEDTLEHTDKEVSVETVSILSKTEGTQEADQYA 1269 

Qy 835 DSSPIEI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPC 885 

I : : II I I III: I : : I I ! I I 

Db 1270 DEKTKDVPFFEGLEGSIDTGITVSREKVTEVALKGEGTEEAECKKDDALELQSHAKSPPS 1329 

Qy 886 LELPCDLSF KNIYPKDEVHVSDEFSENRSS VSKASISPSNVSALEPQTE 934 

I : : : | | : : | | : : : Ml: I I : : I 

Db 1330 PVEREMWQVEREKTEAEPTHVNEEKLEHETAVTVSEEVSKQLLQTVNVPIIDGAKE 1386 

Qy 935 MGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWF — 992 



Db 1387 VSSLEGSPPPCLGQEEAVCTKIQVQSSEAS FTLTAAAEEEKV--LGETANILETGETLEP 1444 

Qy 993 -GASLFL 998 

I I I I 

Db 1445 AGAHLVL 1451 



RESULT 13 
US-08-994-570-5 

; Sequence 5, Application US/08994570 
; Patent No. 6090929 
; GENERAL INFORMATION: 

APPLICANT: Scott, John D., 
APPLICANT: Nauert, Brian J., 
APPLICANT: Klauck, Theresa M. 
; TITLE OF INVENTION: Protein Binding Domains of Gravin 
NUMBER OF SEQUENCES: 24 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Marshall, O'Toole, Gerstein, Murray & Borun 

STREET: 6300 Sears Tower/233 South Wacker Drive 
CITY: Chicago 
STATE: Illinois 

COUNTRY: United States of America 
ZIP: 60606-6402 



COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/994,570 
FILING DATE: 
; CLASSIFICATION: 

ATTORNEY/AGENT INFORMATION: 

NAME: No. 6090929and, Greta E. 
REGISTRATION NUMBER: 35,302 
; REFERENCE/ DOCKET NUMBER: 27866/33451 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 312-474-6300 
TELEFAX: 312-474-0448 
; TELEX: 25-3856 

INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1780 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-994-570-5 

Query Match 4.5%; Score 265.5; DB 3; Length 1780; 

Best Local Similarity 20.8%; Pred. No. 3.4e-08; 

Matches 264; Conservative 146; Mismatches 486; Indels 371; Gaps 51 

Qy 11 SSSTDSPPRPPPA FKYQFVTE PEDE EDEEEEEDEEED 47 

I t : I I I : I I : I I : IN I : : | : | | : | 

Db 277 SKSAESPTSPVTSETGSTFK-KFFTQGWAGWRKKTSFRKPKEDEVEASEKKKEQEPEKVD 335 

Qy 4 8 DEDLEELEVLERKPAAGLSAAAVPPAAAA — PLLD FS S DS VP PAP RGP LPAAP PAAPERQ 105 

I : : I I II I II : I I I : I | | : I [ 

Db 336 TEEDGKAEVASEKLTAS EQAHPQEPAESAHEPRLSAEYEKV ELPS EEQ 383 

Qy 106 PSWERSPAAPAPSLPPAAAVLPSKLP EDDEPPARPPPPPPAGASP 150 

I : I: I: I I I I: |: : I I : I 

Db 384 VSGSQGPSEEKPA-PLATEVFDEKIEVHQEEWAEVHVSTVEERTEEQKTEVEETAGSVP 442 

QY 151 L AEP — AAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQP 201 

I I I I I I I | | : | | : I : : : I 

Db 443 AEELVGMDAEPQEAEPAKELWLKETCVSGEDPTQGADLSPDEKVLSKPPEGVVSEVEML 502 

Qy 2 02 GNTVSSGQEDFPSVLLETAASLPSLSPLSTVS FKEHGYLG NLSAVSSSEGTI 253 

= : I I I : I I I I : I I : : I : 

Db 503 SSQERMKVQGSPLKKLFTSTGLKKLS GKKQKGKRGGGDEESGEHTQVPADSPDSQ 557 

Qy 254 EETLNEASKELPERATN-PFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTKEEVI 312 

M hi M : : | | | : : I I : : I I I : | : I 

Db 558 EEQKGESSASSPEEPEEITCLEKGIAEVQQDGEAEEGATSDGEKKREGVTPWASFKKMVT 617 

Qy 313 VRSK DKE DLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMS WAPV 362 

: : III I I I I I I : : : : I I : I I I 

Db 618 PKKRVRRPSESDKEDELDKVKSATLSSTEST AS EMQEEMKGS VEE PK 664 



Qy 363 REEYADFKPFEQAWEV KDTYEGS 385 

II :|| Ml: 

D t> 665 PEEPKRKVDTSVSWE7VLICVGSSKKRARRRSSSDEEGGPKAMGGDHQKADEAGKDKETGT 724 

QY 38 6 RDVLA ARANVESK VDRKCLEDSLEQKSLGKDS 417 

: 1 I I : I : I I : I I : II : M 

D b 725 DGILAGSQEHDPGQGSSSPEQAGSPTEGEGVSTWESFKRLVTPRKKSKSKLEEKS--EDS 782 

Qy 418 -EGRNEDASFPSTPEPVKDSS RAYITCAS FT SATE- -ST 453 

I : I I I II I: I :| : I I I I I 

Db 783 IAGSGVEHSTPDT-EPGKEESWVS I KKFI PGRRKKRPDGKQEQAPVEDAGPTGANEDDSD 841 

QY 454 TANTFPLLEDHTSENKTDE KKI EERKAQI ITEKTSPKTSNP FLVAVQD 501 

Ml I : I :: | :: | :::: | : | | | 

Db 8 42 VPAWP LS E YDAVEREKMEAQQAQKGAEQ P EQKAATEVS KEL S E S QVHMMAAAVADGT RA 901 

Qy 502 SEADYVTTDTLSK VTEAAVSNMPEG- 526 

: I I : I : I : III : I I 

Db 902 ATIIEERSPSWISASVTEPLEQVEAEAALLTEEVLEREVIAEEEPPTVTEP LPENR 957 

QY 527 LTPDLVQEA-CES ELNEATGTKI -AYETKVDLVQT S EAIQESLYPTAQ 572 

llhll I I I : I I : : I : : | | : 

Db 958 EARGDTWSEAELTPEAVTAAETAGPLGSEEGTEASAAEETTEMVSAVSQLTDSPDTTEE 1017 

Qy 57 3 L CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVS 623 

I M I II : : I I I I I : I I 

Db 1018 ATPVQEVEGGVPDIEEQERRTQEVLQAVAEKVKEESQLPGTG GPEDVLQPVQ 1069 

QY 624 YDSIKLEPENPPPYEEA MNVALK ALGTKEGIKEPESFNAAVQE 666 

: I I I II : I I | | | | | | | | | 

Db 107 0 RAEAERPEEQAEASGLKKETDWLKVDAQEAKTEPFTQGKWGQTTPESFEKAPQV 1125 

Qy 667 TEAPYISIACDLIKETKLSTEPSPD — FSNYSEIAKFEKSVPEHAELVEDSSPES — EPV 722 

M: I : : : I I : I : | : : : | | : | | 

Db 1126 TES 1 E S S ELVT T CQAET LAGVK S Q EMVME QAI P PDSVETPT 1166 

Qy 723 DLFSDDSIP EVPQTQEEAVMLMKESLTEVSETVAQH KEERLSASPQELGKPY 774 

I = I I I : I I : I : : I : I I : I I : : I 

Db 1167 DSETDGSTPVADFDAPGTTQ KDEIVEIHEENEVHLVPVRGTEAEAVPAQKERPP 1220 

QY 775 LESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFS 834 

I II: : I : : I : : : || : : | : : : : 

Db 1221 APSSFVFQEETKE QSKMEDTLEHTDKEVSVETVSILSKTEGTQEADQYA 1269 

QY 835 DSSPIEI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPC 885 

I :: N I I III: I : : 

Db 1270 DEKTKDVPFFEGLEGSIDTGITVSREKVTEVALKGEGTEEAECKKDD7VLELQSHAKSPPS 1329 

Qy 886 LELPCDLSF KNIYPKDEVHVSDEFSENRSS VSKASI SPSNVSALEPQTE 934 

I : : : I I : : | | : : : | | | : | | : : | 

Db 1330 PVEREMWQVEREKTEAEPTHVNEEKLEHETAVTVSEEVS KQLLQTVNVPIIDGAKE 1386 

QY 935 MGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSVVDLLYWRDIKKTGVVF — 992 

: I : I I : : : : : I I : | : | | : | : | | 

Db 1387 VSSLEGSPPPCLGQEEAVCTKIQVQSSEAS FTLTAAAEEEKV — LGETANILETGETLEP 1444 

Qy 993 -GASLFL 998 



Db 1445 AGAHLVL 1451 



RESULT 14 

US-09-134-001C-4463 

; Sequence 4463, Application US/09134001C 
; Patent No. 6380370 
; GENERAL INFORMATION: 

; APPLICANT: Lynn Doucette-Stamm et al 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
STAPHYLOCOCCUS 

; TITLE OF INVENTION: EPIDERMIDIS FOR DIAGNOSTICS AND THERAPEUTICS 

FILE REFERENCE: GTC-007 
; CURRENT APPLICATION NUMBER: US/ 09/ 134 , 001C 
; CURRENT FILING DATE: 1998-08-13 
; PRIOR APPLICATION NUMBER: US 60/064,964 
; PRIOR FILING DATE: 1997-11-08 
; PRIOR APPLICATION NUMBER: US 60/055,779 
; PRIOR FILING DATE: 1997-08-14 
; NUMBER OF SEQ ID NOS : 5674 
; SEQ ID NO 4463 

LENGTH: 2137 

TYPE: PRT 

ORGANISM: Staphylococcus epidermidis 
US-09-134-001C-4463 



Query Match 4.5%; Score 265; DB 4; Length 2137; 

Best Local Similarity 19.6%; Pred. No. 4.8e-08; 

Matches 172; Conservative 141; Mismatches 451; Indels 114; Gaps 24; 



Qy 


159 


STPAAPKRRGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLE 
II- Ml : 1 : 1:1 : : | : | : | 
STSTSTSDSASTSTSE SESDSASTSLSESTSTSVSDSTSTSTSDSASMSASESE 


218 


Db 


990 


1043 


Qy 


219 


TAASLPSLS PLSTVS FKEHGYLGNLSAV SSSEGTIEETLNEASKELPERATNPFVNR 

: : 1 1 1 :: 1 1 : 1 1 hill : | | | : 
SNSKSTSLSESTSTSLS GSTSASTSDSASTSTSESESDSTSTSLSESTSTSLSGS 


275 


Db 


1044 


1098 


Qy 


276 


DLAEFSELEYSEMGSSFKGSPKGESAI LVE NTKEEVIVRSKDKEDLVC 


323 


Db 


1099 


1 1 : 1 1 : : 1 1 1 1 : I | : : | 
TSASTSD SASTSTSESDSTSESTSLSESLSTSVSDSTSASTSESASTSTSESESNSA 


1155 


Qy 


324 


SAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYE 


383 


Db 


1156 


1=1 : : : : :| : : | : : |: 

STSLSGSLSTSISDSTSTSTSDSASTSTSESESDSTSTSLSE STSTSLSDSTS 


1208 


Qy 


384 


GSRDVLAARANVESKVDRKCLEDSLEQK SLGKDSEGRNEDASFPSTPEPVKDSSRAY 


440 


Db 


1209 


1 1 : : M | ||: | : : 1 :: | | : | : 

TSTSESASTSTSES — DSTSESTSLSESTSTSVSDSTSASTSDSASTSTSVSDSESASTS 


1266 


Qy 


441 


ITCAS FTSATESTTANTFPLLEDHTSE — NKTDEKKIEERKAQIITEKTSPKTSNPFLVA 
1= = II -M: :| III : :: : 1 : ::: || ||: 
ISESLSTSVSDSTSTSTSDSASTSTSESDSTSESTSLSESISTSVSDSTSASTSDSASTS 


498 


Db 


1267 


1326 


Qy 


499 


VQDSEADYVTTD TLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYE 


551 



I : I : |: :| | :: 



Db 


1327 


x ^ijkJD or^tsj x kJUkjvjj x o i oiiOUJiOlOl oUo/\o 1 j! o JjOiJoEjxvrVo 1 oIjoCjoToTSIjSDd 


138 6 


Qv 


552 


TKV DTiVOT^F ATDF^T.YPT AOT.PPClFP'PZiT^ZiT'P Q p\/T DnTWT?7\ dtmct t fihti^-a 

j. i\ v i-z-uv^J. oijrtJ.\^LjOij i ir i/is^-i-J^^ r ror VLrJJl VlnEi/vr -LiN b LLPSAGAb V 


608 


Db 


1387 


1 1 1 1 : : : 1 1 : 1 1 I : : | III 

j. o x o i ouorto iDiovojjo rj d/\o loLiotctoLioi b VbDbTSTSTSDSASAST 


1435 


Qv 


609 


VQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALK-ALGTKEGIKEPESFNAAVQET 


667 


Db 


1436 


III II :: : : : 1 : I | : :: |: 

^> i i)j_ib(cro ibxblbjJblblbl bDSASTSTSVSESNSTSTSlSES 


1486 


Ov 


668 


EAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSD 

= =1 M 1 1 : :: : : :| I |: :|| : 
-uo j.ovo uoioiojL ojjo/\o loi ovoLJbJJbAb I bbbhb V — STSDSESTSTSTS 


727 


Db 


1487 


1536 


Ov 




DSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQE LGKPYLESFQ 

M 1 : : : 1 : 1 I I : : : : III I | | 
uorto i o i o vonoiN o ± o ± oIjo^o loibvoJJolblbl bUbAbAb 1 bhSDSDSASTSSSESVS 


779 


Db 


1537 


1596 


Ov 


7 80 


riN j_in o i j\u/\/\^i\jiJi f i j_i i J\i\tJ\i bLyjyihhjf N I AI YSNDDLLS SKEDKI KESETFS 
: : 1 1 : : 1 1 : 1 : 1 : : : : : | | | : | 
i ^ vbJJb i bAbl btLbAb 1 b 1 b VbDbN bAb rSLSESTSTSLSDSTSMSTSDSASTSTSESDS 


834 


Db 


X «J -7 / 


1656 


Ov 


835 


DSSPIEIID EFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADS 


882 


Db 


J. D J / 


11= : 1 INI:: : :|| : : :||: | 
JJ^A^l bJjbUb 1 bl bvbtib I b I b I STSVSASNSTSTSLSDSRSTSLSDSTSTSTSESGSTS 


1716 


wy 


poo 
o o o 


LPCLELPCDLSFKNIYPKDEVHVSDEFSENR — SSVSKASISPSNVSALEPQTEMGSIVK 

1 1 : : : : 1 1 1 : I : : | : | | | : : : | 
TS--ESDSDSASTSLSESTSTSISDSTSTSTSDSASTSMSVSDSNRASTSLSDSTSTSVS 


940 


Db 


1717 


1774 


Qy 


941 


SKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWD 97 8 

: : 1 1 M 1 1 111:1111 
DSTSASTSESASTSTRESESTSASTSLS-ESTSTSVSD 1811 




Db 


1775 





RESULT 15 

US-09-621-976-4600 

; Sequence 4600, Application US/09621976 
; Patent No. 6639063 
; GENERAL INFORMATION: 

; APPLICANT: Dumas Milne Edwards, J.B. 

; APPLICANT: Jobert, S. 

; APPLICANT: Giordano, J.Y. 

; TITLE OF INVENTION: ESTs and Encoded Human Proteins. 

FILE REFERENCE: GENSET . 054 PR2 
; CURRENT APPLICATION NUMBER: US/09/621, 976 
; CURRENT FILING DATE: 2000-07-21 
; NUMBER OF SEQ ID NOS : 19335 
; SOFTWARE: Patent. pm 
; SEQ ID NO 4600 

LENGTH: 75 

TYPE: PRT 
; ORGANISM: Homo sapiens 
; FEATURE : 

NAME/KEY: UNSURE 

LOCATION: 5 8 

OTHER INFORMATION: Xaa = His, Pro 



NAME /KEY: UNSURE 
LOCATION: 28 

OTHER INFORMATION: Xaa = Met,Val 
NAME/KEY: UNSURE 
LOCATION: 19 

OTHER INFORMATION: Xaa = Pro, Gin 
NAME/ KEY: UNSURE 
LOCATION: 53 

OTHER INFORMATION: Xaa = Ser,Tyr 
US-09-621-976-4600 

Query Match 4.4%; Score 256.5; DB 4; Length 75; 

Best Local Similarity 67.0%; Pred. No. 1.2e-09; 

Matches 59; Conservative 3; Mismatches 11; Indels 15; Gaps : 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

M I : I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | M | | | | 

Db 1 MEDLDQSPLVSSS-DSPPRXQPAFKYQFXREPEDEE EDLEELEVLERK 47 

Qy 61 PAAGLSAAAV — PPAAAAPLLDFSSDSV 86 

MINIM III I I I : I I : I I 
Db 48 PAAGLXAAPVXTAPAAGAPLMDFGNDFV 75 



Search completed: September 29, 2004, 18:20:58 
Job time : 37.8717 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: September 29, 2004, 18:05:43 ; Search time 28.465 Seconds 

(without alignments) 
3930.111 Million cell updates/sec 

Title: US-09-83 0-972-2 

Perfect score: 5848 

Sequence: 1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKI PGLKRKAD 1163 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283366 seqs, 96191526 residues 



Total number of hits satisfying chosen parameters: 283366 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : PIR_78:* 
1: pirl:* 
2: pir2:* 
3: pir3:* 
4: pir4:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


789.5 


13 


5 


776 


2 


A46583 


neuroendocrine- spe 


2 


685 


11 


7 


208 


2 


160904 


neuroendocrine- spe 


3 


671 


11 


5 


267 


2 


A60021 


tropomyosin- relate 


4 


517 


8 


8 


2484 


2 


T26216 


hypothetical prote 


5 


503.5 


8 


6 


2607 


2 


T26215 


hypothetical prote 


6 


328.5 


5 


6 


5327 


2 


T13564 


microtubule-associ 


7 


322 


5 


5 


7962 


2 


138346 


elastic titin - hu 


8 


320 


5 


5 


222 


2 


T26213 


hypothetical prote 


9 


304.5 


5 


2 


873 


2 


A47283 


calphotin - fruit 


10 


302.5 


5 


2 


1829 


2 


T24583 


hypothetical prote 


11 


295.5 


5 


1 


865 


2 


A47282 


calcium-binding pr 


12 


292 


5 


0 


3507 


2 


T34513 


hypothetical prote 


13 


291.5 


5 


0 


2364 


2 


A56577 


microtubule-associ 



14 


288.5 


4 


. 9 


971 


2 


T19431 


hypothetical prote 


15 


284 


4 


. 9 


2464 


1 


QRMSP1 


microtubule- as soci 


16 


281 


4 


. 8 


3488 


2 


T34418 


hypothetical prote 


17 


279.5 


4 


.8 


1262 


2 


T22523 


hypothetical prote 


18 


277 


4 


.7 


1621 


2 


A82255 


hypothetical prote 


19 


275.5 


4 


. 7 


3924 


2 


S37431 


ankyrin 2, neurona 


20 


275 


4 


.7 


1299 


2 


T47182 


hypothetical prote 


21 


273.5 


4 


.7 


1029 


2 


T30351 


mucin-like protein 


22 


273.5 


4 


.7 


1274 


2 


T16251 


hypothetical prote 


23 


273 


4 


.7 


1558 


2 


B71603 


RESA-H3 antigen PF 


24 


272.5 


4, 


.7 


3534 


2 


T42567 


tegument protein 2 


25 


270 


4, 


. 6 


1230 


2 


T22458 


hypothetical prote 


26 


269.5 


4, 


. 6 


2187 


2 


T30826 


nascent polypeptid 


27 


267 


4, 


. 6 


1684 


2 


JW0057 


gravin - human 


28 


265. 5 


4. 


.5 


1828 


2 


A40115 


microtubule-associ 


29 


263 


4, 


.5 


1825 


2 


S13507 


microtubule-associ 


30 


261. 5 


4, 


. 5 


1087 


1 


QFMSH 


neurofilament trip 


31 


261 


4. 


,5 


606 


2 


A43427 


neurofilament trip 


32 


261 


4, 


,5 


2570 


2 


T17451 


f imbriae-associate 


33 


260 


4. 


.4 


1804 


2 


T34518 


nestin - golden ha 


34 


259.5 


4. 


.4 


1020 


1 


QFHUH 


neurofilament trip 


35 


259 


4. 


4 


1510 


2 


T33100 


hypothetical prote 


36 


258.5 


4. 


4 


1830 


2 


A37981 


microtubule-associ 


37 


257 


4. 


4 


1224 


2 


T14007 


microtubule-associ 


38 


256 


4. 


4 


2361 


2 


T25752 


hypothetical prote 


39 


254.5 


4. 


4 


6642 


2 


T29757 


protein UNC-89 - C 


40 


254 


4. 


3 


1616 


2 


G64242 


cytadherence-acces 


41 


254 


4. 


3 


3381 


2 


T42389 


versican precursor 


42 


253 


4. 


3 


3421 


1 


WZBEB6 


367K tegument prot 


43 


252.5 


4. 


3 


5170 


2 


T15348 


hypothetical prote 


44 


251 


4. 


3 


1824 


1 


QRHUMT 


microtubule-associ 


45 


250.5 


4. 


3 


4377 


2 


A55575 


ankyrin 3, long sp 



ALIGNMENTS 



RESULT 1 
A46583 

neuroendocrine-specific protein, splice form A - human 

N; Contains: neuroendocrine-specific protein, splice form B 

C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C;Accession: A46583; 160903 

R;Roebroek, A.J.; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaekers 
F.C.; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specific gene and identification of its 135-kDa translational 
product . 

A; Reference number: A46583; MUID: 93293865; PMID: 7685762 
A; Accession : A4 6583 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-776 <R0E1> 

A; Cross-references: GB:L10333; NID:g307306; PIDN: AAA59950 . 1; PID:g307307 
A; Accession : 160903 



A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: mRNA 

A; Residues: 421-776 <R0E2> 

A;Cross-references: GB:L10334; NID:g307308; PIDN : AAA59951 . 1 ; PID:g307309 

C; Genetics : 

A; Gene: GDB : RTN1 ; NSP 

A;Cross-references: GDB: 203968; OMIM: 600865 
A; Map position: 14q21-14q22 

Query Match 13.5%; Score 789.5; DB 2; Length 776; 

Best Local Similarity 31.2%; Pred. No. 3.5e-25; 

Matches 243; Conservative 115; Mismatches 252; Indels 169; Gaps 28; 

QY 4 87 TSPKTSNPFLVAVQDSEADYWTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGT 54 6 

: I : I I I : : : : | : | | | : | : | . 
D b 65 SGPARQSP--VAMETASTGVAGVS SAMDHTFSTTSKDGEG SCYTSLI S 110 

QY 547 KIAYETKVDLVQTSEAIQ-ESLYPTAQLCPSFEEAEATPSPVLPDI--VMEAPLNS 599 

I I : I : : | | : : | | | I | | | | | : : | | 

Db HI DICYPPQEDSTYFTGILQKENGHVTISESP— EELGTPGPSLPDVPGIESRGLFSSDSG 167 

QY 600 — LLPSAGASWQPSVSPLEAPPPVSY DSIKLEPENPPPYEEA M 641 

: I: I : II: :| : :| : :: | |: 

Db 168 IEMTPAESTEVNKILADPLDQMKAEAYKYIDITRPEEVKHQEQHHPELEDKDLDFKNKDT 227 

Qy 642 NVALKALGTKEGIKEPE SFNAAVQETEAPYISIACDLIKETKLSTE-PSP 690 

: : : : I I I : : I I : : : I I I I I I I : I : : : : I 

Db 228 DISIK PEGVREPDKPAPVEGKIIKDHLLEESTFAPYID DLSEEQRRAPQITTP 280 

QY 691 DFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLF-SDDSIPEV PQTQEEAVMLMK 745 

: I I I : : : | | : : | | | : : | | | : s 

Db 281 VKITLTEIE PSVETTTQEKTPEKQDICLKPSPDTVPTVTVSEPEDDSPGSITPP 334 

QY 746 ESLTEVSETVAQHK EERLSASPQELGKPYLESFQP NLHSTKDAASND 792 

I II I :l I :l ::| : I I : | I : : 

Dk> 335 SSGTEPSAAESQGKGSISEDELITAIKEAKGLSYETAENPRPVGQLADRPEVKARSGPPT 394 

Qy 793 IPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKI 827 

II: II:: : I I : : | | : 

Db 395 IPSPLDHEASSAESGDSEIELVSEDPM — AAEDALPSGYVSFGHVGGPPPSPASPSIQYS 452 

QY 828 KESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEI ANIQSG 879 

:| I M hi I || ::||| :: :: | | : 

Db 453 ILREEREAELDS ELIIESCDASSASEESPKREQDSPPMKPSALDAIREETGVRAEER 509 

Qy 880 ADSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVS KASI SPSNVSALEPQTEMGSIV 939 

: M I I I 1111:11 

Db 510 APSRRGLAEPG — SFLD-YPSTEPQPGPEL PPGDGALEPETPM 549 

Qy 940 KSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE LSKTSWDLLYWRD 984 

N I : : I : : : | : | : | I | I I I I 

Db 550 LPRKPEEDSSSNQSPAATKGPGPLGPGAPPPLLFLNKQKAIDLLYWRD 597 

Qy 985 IKKTGWFGASLFLLLSLTVFSIVS VTAYIALALLS VTISFRIYKGVIQAIQKSDEGHPF 1044 

M : I I : I I I : I || Ml 11 = 11! I I : I I I || | | | | | | M I : M : I I : I I I I I I 
Db 598 IKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPF 657 



Qy 1045 RAYLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVXMWVFTYVGA 1104 

= IMI I : : I : I : I I I :: : I I I I : I I I II I I M I I I I I I I I I I I I I : Mill 

Db 658 KAYLELEITLSQEQIQKYTDCLQFYWSTLKELRRLFLVQDLVDSLKFAVIJyiWLLTYVGA 717 

QY LFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKD^ 1163 

M I I I I M : : I : : I : I : : I I : I : | | | | | | | | | : : I I I I I I I I I II I : 
Db 718 L FNGLT L LLMAWSMFT L P WYVKHQAQ I DQ YL GLVRTH I NAWAKI QAK I P GAKRHAE 776 

RESULT 2 
160904 

neuroendocrine-specif ic protein C - human 
C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C; Accession : 160904 

R;Roebroek, A. J. ; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaekers, 
F.C.; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A;Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specific gene and identification of its 135-kDa translational 
product. 

A;Reference number: A46583; MUID : 93293865 ; PMID:7685762 
A; Access ion: 160904 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-208 <RES> 

A;Cross-references: GB:L10335; NID:g307310; PIDN : AAA59952 . 1 ; PID:g307311 

C;Genetics : 

A; Gene: GDB : RTN1 ; NSP 

A; Cross-references : GDB: 203968 ; OMIM: 6008 65 
A; Map position: 14q21-14q22 

Query Match 11.7%; Score 685; DB 2; Length 208; 

Best Local Similarity 67.0%; Pred. No. le-21; 

Matches 12 8; Conservative 32; Mismatches 31; Indels 0; Gaps 0; 

Qy 973 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIYKGVI 1032 

I : : I I I I I I I I I : I I : I I I : I I I IN | | : | | | | | : | | | | I I I I I ! I , | : 
Db 18 KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLAIAALSATISFRIYKSVL 77 

Qy 1033 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 1092 

M = M = IIIIII:||II I: :|:| :|||:: : I I I I : I I I I M M I I I 

Db 78 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKF 137 

Qy 1093 AVLMWFTWGALFNGLTLLILALISLFS1PVIYERHQVQIDHYLGLANKSVKDAMAKIQ 1152 

11111= I M I I I I I I I I I I : : I : : I : I : : | | : | : | | I | | | | | : : : , | | 

Db 13 8 AVLMWLLTYVGALFNGLTLLLMAWSMFTLPWYVKHQAQLDQYLGLVRTHINAWAKIQ 197 

Qy 1153 AKI PGLKRKAD 1163 

I I I I I II I : 
Db 198 AKI P GAKRHAE 208 



RESULT 3 
A60021 

tropomyosin-related protein, neuronal - rat 
C; Species: Rattus norvegicus (Norway rat) 



C;Date: 03-Mar-1993 #sequence_revision 03-Mar-1993 #text_change 05-Nov-1999 

C; Accession: A60021 

R;Wieczorek, D.F.; Hughes, S.R. 

Brain Res. Mol . Brain Res. 10, 33-41, 1991 

A;Title: Developmental ly regulated cDNA expressed exclusively in neural tissue. 
A; Reference number: A60021; MUID : 91278684 ; PMID: 1647480 
A; Accession: A60021 
A; Molecule type: mRNA 
A; Residues: 1-2 67 <WIE> 

A;Cross-references: EMBL: X52817 ; NID:g456549; PIDN: CAA37 001 . 1; PID:g456550 
C; Comment: This neuronal-specif ic mRNA was identified by hybridization to an 
alpha-tropomyosin probe but does not show homology in amino acid sequence. 

Query Match 11.5%; Score 671; DB 2; Length 267; 

Best Local Similarity 66.3%; Pred. No. 5.6e-21; 

Matches 124; Conservative 33; Mismatches 30; Indels 0; Gaps 0; 

QY 973 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIAIALLSVTISFR1YKGVI 1032 

I : : I I I I I I I I I : II : I I I : I II I I I I I : : I I I II I I I I I M I I : 

Db 9 KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLAIAALSATISFRIYKSVX 68 

Qy 1033 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 1092 

M:||:|IMI|:|||| |: :|:| :|||:: : I I I I : M I I I I I I I IMIIIM 

Db 69 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 128 

Qy 1093 AVLMWFTWGT^FNGLTLLILALISLFSIPVIYERHQVQIDHYLGl^KSVKDAMAKIQ 1152 

M I I I : I I I I I I I I I I I I I : : I : : I : | : : M : | : | | | : | | | | | : : | | | | 

Db 12 9 AVLMWLLTWGALFNGLTLLLMAWSMFTLPVVWKHQA 18 8 

Qy 1153 AKIPGLK 1159 

Mill : 
Db 189 AKIPGAR 195 



RESULT 4 
T26216 

hypothetical protein W06A7.3c - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Accession : T2 6216 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A;Reference number: Z20173 
A;Accession: T2 6216 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-2484 <WIL> 

A;Cross-references: EMBL:Z78066; PIDN : CAB51467 . 1; GSPDB : GN00023 ; CESP : W06A7 . 3c 

A; Experimental source: clone W06A7 

C; Genetics : 

A; Gene: CESP : W06A7 . 3c 

A;Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2339/2; 2463/2 

Query Match 8.8%; Score 517; DB 2; Length 2484; 

Best Local Similarity 20.1%; Pred. No. 2.7e-13; 

Matches 262; Conservative 189; Mismatches 413; Indels 440; Gaps 51; 



QV 28 FVTEPEDEE DEEEEEDEEED DEDLEELEVL 57 

Ml : I I : I | | | | I II : | : | 

Db 1437 FGTESSEESQKADGNQENQEEEDWAELNFHPIRQWRDEDVISLQSLKSLVAEVGCITDV 1496 

58 ERKPAAGLSAAAVP PAAAAPL-LDFSSDSVPPAPRGPLPAAPPAAP — ERQPSW 108 

I: : i I I: : I Ill::| I |:| || | | 

Db 14 97 DASDVNEQDEESTLKILKWPSEPSLLELDFTND — PKVI HVPIPLMEPATMYLEEMVEW 1554 

Qy 109 E R S PAAPAP S L P PAAAVL P S KLP-EDDEPPARPPPPPPA 14 6 

I : I I : : : I I I I I I II III 

Db 155 5 HADAVKEVSEMEWTESEISEMAPQVSESTCPIPEPLADLKLPVEDDEKTPEPEPWPG 1614 

Qy 147 GASPLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPS-SAEKIMDLMEQPGNTV 205 

I I 1 = 1 I I I :: | : | | : I : I I 

Db 1615 QVQ ERI I PIEVEQAPTI PQRPPRAPKSE LPKVAKPLDDSKSRVRFAPLNIKLGRTY 1670 

Qy 206 S S GQEDFPSVLLETAAS LPSLS PLSTVS FKEHGYLGNLSAVS S SEGT I EETLNEAS KELP 265 

lh I II II: :: :: | 

Db 1671 SEEQQK ELVES LE--RPLTI IT QQKPP 1695 

Qy 2 66 ERAT NPFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKE 319 

hi : I I I I : I : : I I 
Db 1696 EKPTEDIGALSPLSPNTLAEYEEVPMMDMQS 1726 

Qy 320 DLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQ-MSWAPVREEYT^DFKPFEQAWEV 378 

I I I I I I : : : I : I I I I I I I I I I 

Db 1727 VPHSPQE KQEEI EALSEIIEEPQAMKEVE KPVESAPE- 1763 



Qy 379 KDTYEGSRDVLAARANVESKVDRKCLEDSL — EQKSLGKDSEGRNEDASFPSTPEPVKDS 436 
II : I I : : | : | : | | | : | | : | 
1764 KD NESLEAPEI INEPI RRVLVETKIMGPGKSLNED NDD 1801 



Db 



^ 43 7 SRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKAQIITEKTSPKTSNPFL 496 

= 1 M : II II 
Db 1802 DDDGSECLDSIGDLSER TIQRFN 1824 



Qy 497 VAVQDSEADYVTTDTLSKVT EAAVSNMPEGLTPDLVQEACESELNEATGTKI 54 8 

: : I : |: I :: 
1825 TSIDDPS IRRDSFSSISSFGDRQKFRTAIENIRQDLLP 1862 



Db 



Qy 549 AYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASV 608 

: : : I I: :: | |: || : : : : | : : | : | | : | 
Db 1863 -FQSSV SQYLRSSPNPSQQL LVTNLSMDSP-SDLSPNA 1898 

Qy 609 VQPSVSPLEAPPPVSYDSI KLEPENPPPYEEAMNVALKALGTKEGIKEPESFNAA 663 

Ml::: M : I : I I : : : | : : : : 

Db 1899 PPVGFENTAQFLEKLQQEDRPSAEGSID SSGFEKVD--HEG 1937 

Qy 664 VQETEAPYISIACDLIKETKLSTEPSPDF SNYSEIAKFEKSVPE 707 

: I II : | :::: : | | ., . . , : | 

Db 1 9 38 LDEFAAPPVH— DPMQKSVFGSLGSDDMKPGSQDDGFVFIERNEANEATLKKNQKMSSH 1994 

Qy 708 HAELVEDS SPESEPVDLFSDDSIPEVPQTQEEAV MLMKESLTE 750 

I ::M : II :| |:: : :::|| : :| | 

Db 1995 HNDVI EKNYFNDNAPTAALLE S P I AEEARKLVQDAVES AS EYKKQAVDS GDE I GRELLDN 2054 



Qy 751 VSETVAQHKE ERLSASPQELGKPYLESFQP 780 

I : : I II | |: :|: :| 

Db 2055 VEQKIEQVKEPIVDSLHKAYDGVGDFVHETVPNAVDDFVREAEKQLPESPVPEKIETPEP 2114 

QY 781 --NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDK IKES 830 

=:l II::: ::| I : ||: :|| :: 

Db 2115 LVDIHDTVDKVHDEVDNFLRREP TPPFETDDVAPLSDDKPQFGNQTPEEDE 2165 

QY 831 ETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGA DSLPCL 886 

II I: I :| :|::: :: I |: II :|| 

Db 2166 TTFDRKGPLTIPEEVEKAAAAQNND LDDFDPLVTSNTGAAFGAAVGAAAAVESLTEE 2222 

QY B87 ELPCDLSFKNI YPKDEVHVSDEFSENRSSVSKASI SPSNVSA LEPQTEM 935 

I : I : : III : I I I : I I I : I : 

Db 2223 EMFGHQKFETVPRPPTPPKD ISDE DVKPSTVNLGPSHHHSHPSSPH 2268 

Qy 936 GSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGAS 995 

1 I : I I : : I : I : : I I I I I I : : I : 

Db 2269 HSILKHHG DAW I DFKTVP P CVLDVI YWRDAKKSAI VLS LA 2308 

QY 996 LFLLLSLTVFS IVSVTAYIALALLSVTI S FRI YKGVIQAIQKSDEGHPFRAYLESEVAI S 1055 

IH I : : : : I I I I I I : : I I I : I : I III I : : : 

Db 2309 LLVLFVLAKYPLLTVVTYSLLLALGAAAGFRVFKKVEAQIKKTDSEHPFSEILAQDLTLP 2368 

Qy 1056 EELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILA 1115 

: I I : : : I : I : = I I : : : I : I I : : : I I I : : I : I I I I I 

Db 2369 QEKVHAQADVFVEHATCIANKLKKLVFVES PLES I KFGLVLWSLT YI ASWFSGFTLAI LG 2428 

Qy 1116 LI SLFSI PVI YERHQVQIDHYLGLANKSVKDAMAKIQAKI PGLK 1159 

I : : I I : I : I I : I I I : I : : I : | I : I , : 

Db 2429 LLGVFSVPKVYESNQEAI DPHLATI SGHLKNVQNI IDEKLPFLR 2472 



RESULT 5 
T26215 

hypothetical protein W06A7.3a - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Access ion: T26215 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A;Reference number: Z20173 
A; Access ion: T26215 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 

A; Residues: 1-2607 <WIL> 

A;Cross-references: EMBL:Z78066; PIDN : CAB01522 . 2 ; GSPDB : GN00023 ; CESP:W06A7 . 3a 

A; Experimental source: clone W06A7 

C; Genetics : 

A; Gene: CESP : W06A7 . 3a 

A; Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2412/1; 2462/2; 
2586/2 



Query Match 8.6%; Score 503.5; DB 2; 

Best Local Similarity 20.8%; Pred. No. le-12; 
Matches 280; Conservative 181; Mismatches 481; 



Length 2607; 
Indels 407; Gaps 



52; 



Qy 2 8 FVTEPEDEE DEEEEEDEEED DEDLEELEVL 57 

I I I : I I : I I I I I | | | : | : | 

Db 1437 FGTESSEESQKADGNQENQEEEDWAELNFHPIRQWRDEDVISLQSLKSLVAEVGCITDV 1496 

Qy 58 ERKP AAGL S AAAVP P AAAAP L-LDFSSD S VP PAP RG PL P AAP P AAP — ERQPSW 108 

I : : I I I : : I I I I :: I I hi II | I 

Db 1497 DASDVNEQDEESTLKILKWPSEPSLLELDFTND — PKVIHVPI PLMEPATMYLEEMVEW 1554 

Qy 109 ERSPAAPAPSLPPAAAVLPS KLP-EDDEPPARPPPPPPA 146 

I : M : : : I I I I I I I I III 

Db 1555 IIADAVKEVSEMEWTESEISEMAPQVSESTCPIPEPLADLKLPVEDDEKTPEPEPWPG 1614 

Qy 147 GASPLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPS-SAEKIMDLMEQPGNTV 205 

I I I : I I I I : : | : | | : I : | I 

Db 1615 QVQERIIPIEVEQAPTIPQRPPRAPKSE LPKVAKPLDDSKSRVRFAPLNIKLGRTY 1670 

Qy 206 SSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELP 265 

II: I M II::: :: I 

Db 1671 SEEQQK ELVESLE— RPLTIIT QQKPP 1695 

Qy 266 ERAT NP FVNRDLAEFS ELEYS EMGS S FKGS P KGESAI LVENT KEEVI VRS KDKE 319 

I : I : I I I I : I : : I I II : : | | : I : I 

Db 1696 EKPTEDIGALSPLSPNTLAEYEEVPMMDM-QSVPHSP QEKQEEIEALSEI IE 1746 

Qy 320 DLVCSAALHSPQES-PVGKEDRWSPEKTMDIFNEMQMSWAPVREEYMFK PFEQA 375 

: : I I I I : : : I I III I : I : I I : 

Db 1747 EPQAMKEVEKPVESAPEKDNESLEAPE IINE PIRRVLVETKIMGPGKSL 17 95 

Qy 37 6 WEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDS E 418 

II: : I : : : : : : | : : | : : | | | 

Db 1796 NEDNDDDDDGSECLDSIGDLSERTIQR-FNTSIDDPSIRRDSFSSISSFGDRQKFRTAIE 1854 

Qy 419 GRNED ASFPSTPEPVKDSSRAYITCAS FTSATESTT ANTFPLLE 462 

: I 1:11 I : : I I I : : : | | | | 

Db 1855 NIRQDLLPFQSSVSQYLRSSPNP SQQLLVTNLSMDS PS DLS PNAP PVGFENTAQFLE 1911 

Qy 463 DHTSENKTDEKKIEERKAQII TEKTSPKTSNPF 495 

I : I I : :: | | : :: | : : 

Db 1912 KLQQEDRPSAEGSIDSSGFEKVDHEGLDEFAAPPVHDPMQKSVFGSLGSDDMKPGSQDDG 1971 

Qy 496 LVAVQ D S EAD YVT TDTL5K VTEAAVSNMPEGLTPDLVQ 533 

I :: :||: I I : I : |: :: I I |: 

Db 1972 FVFIERNEANEATLKKNQKMSSHHNDVIEKNYFNDNAPTAALLESPIAEEARKLVQDAVE 2031 

Qy 534 EACE SELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVL 588 

II : : | :: : : | | | : | | : | I I : ; 

Db 2032 SASEYKKQAVDSGDEIGRELLDNVEQKIEQVKEPIVDSLHKAYDGVGDFVH-ETVPNAV- 2089 

Qy 589 PDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKAL 648 

I I M | I: | 
Db 2090 DDFVREAE KQLPESPVP 2106 

Qy 649 GTKEGI KEPES FNAAVQETEAP YI S I ACDLI K ETKLSTEPSPDFSNYSEIAKFEK 703 

11:11 1:1 : I : I ||:| I 
Db 2107 EKIETPE PLVDIHDTVDKVHDEVDNFLRREPTPPFE 2142 



Qy 704 SVPEHAELVEDSSPESEPVI)LFSDDSIPEVPQTQE^VMIiMKESLT EVSETVAQHKE 760 

Db 2143 TDDVAPLSDDKPQFGNQT-PE- EDETTFDRKGPLTI PEEVE KAAAAQNN 218 9 

Qy 761 ERLSASPQELGKPYLESFQPNLHSTKDA- ASNDIPTLTKKEKISLQMEEFNTAI 813 

Db 2igo o LDDFDPLVTSNTGAAFGAAVGAAAA.VESLTEEEMFGHQ--KFETVP 2234 

Qy 814 YSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDSP KLAKEYTDLEVSDKS 871 



2235 RPPTPPKDISDEDVKPS— TVNLGPSHHHSHPSSPHHSILKHHGDAWIDFKT 2284 

Qy 872 EIANIQSG ADSLPCLE----LPCDLSFKNIYPKDEVH 904 

2285 VPPCAQNAFSPGEIMFLLAFFVYLSCFASFFSKSLPLLDNLLSLWYLSISLI IH 2339 

905 VSDE FSENRSSVSKASISPSNVSA- LEPQTEMGSIVKSKSLTKEAEK 950 



Db 



Db 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



' UJ v °"~ 7 . . . | | . | | : : I II I 

2340 VKHHRKFRWNEEQATTMSKLGAVGRGLYALIAFIVNIVLRVGLNVALWGVAVSAHEAYK 2399 

951 KLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSV 1010 

I I II | : I : : I I I I I I : : I : ' : 1 1 : : : * ' „ „ , 

2400 LTK g SGVLRKKEVLDVIYWRDAKKSAIVLSLALLVLFVLAKYPLLTV 2446 

1011 TAYIAI^LSVTISFRIYKGVIQAIQKSDEGHPFPAYLESEVAISEELVQKYSNSALGHV 1070 

2447 VTYSLLLALGAAAGFRVFKKVEAQIKKTDSEHPF^ 2506 
1071 NSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI5LFSIPVIYERHQ 1130 

- I • • I I • : : I : I I : : : I I I = = I : I II I 1 1 : : 11 : 1 : 11 " 1 nJzrr 
2507 TCIANKLKKLVFVESPLESIKFGLVLWSLTYIASWFSGFTLAILGLLGVFSVPKVYESNQ 2566 



1131 VQIDHYLGLANKSVKDAMAKIQAKIPGLK 1159 

|| : | : : I : I I = I I : 
2567 EAIDPHLATISGHLKNVQNIIDEKLPFLR 2595 



RESULT 6 

Srotubule-associated protein homolog - fruit fly (Drosophila melanogaster) 
N; Alternate names: hypothetical protein EG:49E4.1 

C; Species: Drosophila melanogaster h 17-Nov-2000 

C;Date: 13-Aug-1999 #sequence_revision 13-Aug 19yy fftext_ciidiiy 

C;Accession: T13564 ' „. T • Trm-i<? C 

R;S P anos, L.; Papagiannakis , G.; Sxden-Kiamos I., Louxs, C. 

A;Reference number: Z17689 
A-Arression: T13564 

A^tatus: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 
A'Residues: 1-5327 <SPA> 

A;Cross-references: EMBL : AL031128 ; PIDN : CAA20006 . 1 
C; Genetics: 

^^^2,?^Xn^n, 2S«/1, «"./3, 52"/l 

A;Note: EG:49E4.1 



C; Superfamily: Drosophila 576K microtubule-associated protein homolog 



Query Match 5.6%; Score 328.5; DB 2; Length 5327; 

Best Local Similarity 22.7%; Pred. No. 4.1e-05; 

Matches 255; Conservative 176; Mismatches 453; Indels 238; Gaps 54; 

Oy 2 EDIDQSSLVSSSTDSPPRPPPAFKYQFVTEP EDEEDEEEEEDEEEDDEDLEELEVL 57 

:: |:| I : I I I : I I :ll I: II 1=1 I 

Db 3196 DEADKSKEESRRESGAEKSPLASKE — ASRPASVAESIKDEAEKSKEESRRESVAEKSPL 3253 

Qy 58 ERKPAAGLSAAA — VP PAAAAPLLDFS SDSVPPAPRGPLPAAP PAAP 102 

||: : : | I I : I I I I I : M : : I 

Db 3254 PSKEASRPTSVAKSVKDEAEKSKEESSRDSV — AEKSPLASKEASRPASVAESVQDEAEK 3311 

q v 103 ERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLA-EPAA 156 

|: |:|| | : I 1:1 I I I INI : I: 

Db 3312 SKEESRRESVAEKSPLAYKEASRP-ASVAESIKDEAEKSKEESRRESVAEKSPLASKEAS 3370 

Qy 157 ppsTPA AP KRRG S GS VDET L FAL P AAS - E P VI P S S - AE KI MDLMEQ P GNT VS S G 208 

|:: I II: II I I I I |:| I , : I I: 

Db 3371 RPTSVAESVKDEAEKSKEESSRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRRE 3430 

Qy 209 QEDFPSVLLETAASLPSLSPLSTVSFKEHG YLGNLSAVSSSEGTIEETLNEAS 261 

|| Ml: I I : I : : I I : : : I : 

Db 3431 SVAEKSPLASKEASRPASVAESVKDDAEKSKEESRRESVAEKSPLASKEASRPASVAESV 3490 

Qv 262 KELPERATNPFVNRDLAEFSELEYSEMG — SSFKGSPKGESAI LVENTKEEVIVRSKDKE 319 

|: |:: :|| I I I :| I I I: I :IM : l 
Db 3491 KDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVKDEA EKSKEE SRRE 3541 

Qy 320 DLVCSAALHSPQES-PVGKEDRW-SPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWE 377 

: : : | | : I I : I I I : : : : : I : : I : I I 

Db 3542 SVAEKSSLASKE1ASRPASVAESVKDEAEKSKEESRRESVAEKSPLASKEAS-RPASVAES 3600 

Qy 378 VKDTYEGSRDV LAARANVESKVDRK — CLEDSLEQKSLGKDSEGRNE 422 

Ml I I:: I :| :: : II : : :|:: :: I I I 

Db 3601 VKDEAEKSKEVSRRESVAEKSPLPSKEASRPTSVAESVKDEADKSKEESRRESGAEKSPL 3660 

Qy 423 DASFP-STPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTS ENKTD- 471 

:|| I I I III : : I II I : M I I : ' 

Db 3 661 ASMEASRPTSVAESVKDETEK SKEESRRESVTEKS-PLPSKEASRPTSVAESVKDE 3715 

Qy 472 -EKKIEERKAQIITEKT SPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEG- 526 

|1 | | : : : I I : I : : I I I I — = I : : ' I I 

Db 3716 AEKSKEESRRESVAEKSPLASKESSRP ASVAESIKDEAEGTKQESRRESMPESG 37 69 

527 LTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYP 569 

I I I I : : I : I : I 



Qy 



Db 3770 KAESIKGDQSSLASKETSRPDSWESVKDETEKPEGSAI DKSQVASRPESVAVSAKD 3826 

Qy 570 TAQLCPS FEEAEAT P S PVLPDIVMEAPLNS LLP SAGAS WQP SVSPLEAP PPVS YDS I KL 629 

| |: | I II II I II :: 11:1 I I 
Db 3827 EKSPLHSRPESVADKS PDASKEA SRSLSVAETASSPIEEGPRSIAD 3872 

0v 630 EPENPPPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPS 689 

::: I I :| I I : :: : I :: I I : I 

Db 3873 LSLPLNLTGEAKG-KLP-TLSSPIDVAEGDFL EVKAESSPR 3911 



Qy 690 PDFSNYSEIAKFEKSVPEH — AELVEDSSPESEPVDLF SDDSIPEV 733 

| : |: |:| : I : |:::|| I :: : 

Db 3912 p— AVLSKPAEFSQPDTGHTASTPVDEASPVLEEIEWEQHTTSGVGATGATAETDLLDL 3969 

Qy 734 PQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAASNDI 793 

: I : t I |:| | : II: I :: : I I I : — 

Db 3970 TETKSETV — TKQSETTLFETLTSKVESKVEVLESSV-KQVEEKVQTSVKQAETTVTDSL 4026 

Qy 794 PTLTKKEKISLQMEE FNTAIYSNDDLLSSKEDKIKESETFSDSSP 838 

I I I I I I: I |:||: :: : :|: I I 

Db 4027 EQLTKKS--SEQLTEIKSVLDTNISNVTNLFSTAVETIEKKVQDVTEKVIEKAT 4078 

Qy 839 IEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIY 898 

| : | I : : | : : : | : I : : I I : I : I 

Db 4079 -EHVSEHVT — TTGESSTETSQEKSSLDLGTFSELRETHITTVGSPEFTV TIC 4128 

899 PKDE— VH-VSDEFSENR SSVSKAS-ISPSNVSALEPQTE MGSIVKS- 941 

: | | : | : : I I : I Nihil : I I : I : : : I I 

Db 4129 ERDEPVLHDIKEEDEEHRFSPPSDVDKAAIIPPQPMRPLSPREEEVAKIV7VDVAKVLKSD 4188 



Qy 



Qy 942 KSLT KEAEKKLPS— DTEKE-DRSLSAVLSAELS 972 

|:| : : I : I I I I I I : I I : I I hi 

Db 4189 KDITDIIPDFDERQLEEKLKSTADTEEESDKSTRDEKSLEIS 4230 



RESULT 7 
138346 

elastic titin - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 29-May-1998 #sequence_revision 29-May-1998 #text_change 21-Jul-2000 
C;Accession: 138346 
R;Labeit, S.; Kolmerer, B. 
Science 270, 293-296, 1995 

A;Title: Titins : giant proteins in charge of muscle ultras tructure and 
elasticity. 

A; Reference number: A57430; MUID : 96026330 ; PMID: 7569978 
A;Accession: 138346 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-7962 <RES> 

A/Cross-references: EMBL:X90569; NID: gl017426; PIDN : CAA62189 . 1; PID:gl017427 

C; Genetics: 

A; Gene: GDB : TTN 

A;Cross-references: GDB: 127867; OMIM: 188840 
A; Map position: 2q31-2q31 

Query Match 5.5%; Score 322; DB 2; Length 7962; 

Best Local Similarity 21.7%; Pred. No. 0.00014; 

Matches 259; Conservative 146; Mismatches 416; Indels 370; Gaps 58; 

Qy 23 AFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVL ERKPAAGL 65 

II: : II h Mill h:| : I lh hll 
Db 6574 AFEEEVVTHVEEYLVEEEEEYIHEEEEFITEEEVVPVI PVKVPEVPRKPVPEEKKPVPVP 6633 

Qy 66 SAAAVPPAAA APLLDFSSDSVPPA PRGPLP AAP 98 

Ml | : I :||| h I : I i I 



Db 



6634 KKKEL^PPAKVPEVPKKPEEKVP\^IPKKEKPPPAKVPEVPKKPVPEEKVPVPVPKKVEAP 6693 



Qy 



99 PA-APE— RQPSWERSPAAPAP SLPPA-AAVLPSKL-PEDDEP PAR PPPPP 144 

|| || ::| I: III Ml : : :l I = HI 

Db 6694 P AKVP EVP KKP VP E KKVP VP AP KKVEAP P AKVP EVP KKL I P EE KKP T P VP KKVEAP P P KV 6753 



Qy 145 PAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNT 204 

| |: I I I :l II I : : I l: = = I s 

Db 6754 PKKREPVPVPVALPQE EEVLF EEEIVPE--EEVLPEEEE 6790 

Qy 2 05 VSSGQEDFPSVLLETAASLP— SLSPLSTVSFKEHGYLGNLSAVSSSEGTI-- EETLNE 259 

| : I : : I II I I : M I I 

Db 6791 VLPEEEE VLPEEEEVLPEEEEIPPEEEEVPPEEEY VPEEEEFVPEEEVLPE 6841 

Qy 260 ASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESA 1 LVENT KEEVI V 313 

: : I I | : : : | : II I I : : :|: |: 

Db 6842 VKPKVPVPAPVPEIKKKVTEKKW IPKKEEAPPAKVPEVPKKVEEKRII 6890 



Qy 



314 RSKDKEDLVCSAALHSPQESPVGKED RWSPEKTMDIFNEM 354 

| : : I I I : I I : : I : I : I = : : 

Db 6891 LPKEEEVLPVEVT-EEPEEEPISEEEIPEEPPSIEEVEEVAPPRVPEVIKKAVPEAPTPV 6949 



Qy 



355 QMS VVAPVREE YADFKP FEQAWEVT^DT YEGS RDVLAARANVE S K 398 

: I II::: I I : II : :: : I 

Db 6950 P KKVEAP PAKVSKKIP EE KVP VP VQKKEA PPAKVPEVPKKVPEKKVLVPKKEAVPPA 7006 



Qy 



399 VDRKCLEDS LEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCA 444 

III: | | : : : I I : I I I 
Db 7007 KGRTVLEEKVSVAFRQEVWKERLELEWEAEVEEIPEEEEFHEVEE YFEEG 7 058 



Qy 



445 SFTSATESTTANTFPLLEDHTSEN KTDEKKIEER KAQIITEKTSPKT 491 

| | : I : I I : :| :: I: I |:|| I 

Db 7059 EFHEVEEFIKLEQHRVEEEHRVEKVHRVI EVFEAEEVEVFEKPKAPPKGPEISEKIIPPK 7118 



Qy 



4 92 S N P FLVAVQ D S EAD YVT T DT L S KVT EAAV SNMPEGLTPDLV 532 

||: I III : : I I I I I 

Db 7119 KPPTKWPRKEPPAKVPEVPKKIVVEEKVRVPEEPRVPPTKVPEVLPPKEWPEKKVPVP 7178 



Qy 533 QEACESELNEATGTKIAYETKVDL VQTSEAIQESLYP 569 

II :: II :: I II : : : |:: I 

Db 7179 PAKKPEAPPPKVPEAP-KEWPEKKVPVPPPKKPEVPPTKVPEVPKAAVPEKKVPEAIPP 7237 

Qy 570 TAQLCPS — FEEAEATPS PV-LPDIVMEAPLNSLLPSAGASWQPSVSPLE 617 

: 1 | | | I : I I I I : I : : I : I : I : I I : I : : 

Db 7238 KPESPPPEVFEEPEESPSAPPKKPEVPPVRVPEVPKEWPEKKVPAAPPK— KPEVTPVK 7295 



Qy 



618 AP PPVSY DSIKLEPENPPP — YEE 639 

| || : : I : I I : I II : I I 

Db 7296 VPEAPKEWPEKKVPVPPPKKPEVPPTKVPEVPKVAVPEKKVPEAIPPKPESPPPEVFEE 7355 



Qy 640 AMNVALKALGTKEGIKEPESFNAA VQETEAPYISIACDLIKETKLSTEP 688 

Ml: I :: I I I II I I : I I : : I : : I 

Db 7356 PEEVALEE-PPAEWEEPEP — AAPPQVTVPPKNPVPEKKAPAV VAKKPELPPVK 7407 

Qy 689 SPDFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESL 748 

| : | : M I I M I I : I : I I I I I I : 

Db 7408 VPEVP — KEWP-EKKVP LWPKKPEAP PA KVP EVP KEW 7444 



Qy 749 TEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKD7VA.SNDIPTLTKKEKISLQMEE 808 

I I : I : I : I I I I : I : =11 : I : I : : I 

Db 7445 PEKKVAVPKKPEVPPAKVPEVPKKPVLEE-KPAVPVPERAESPPPEVYEEPEEIAPEEE- 7502 

Qy 809 FNTAIYSNDDLLSSKEDK IKESE TFSDSSPIEIIDEFPTFVSAKDDSPKLA 859 

: : : | : | : I I I : II I I I : : I 

Db 7503 IAPEEEKPVPVAEEEEPEVPPPAVPEEPKKIIPEKKVPVIKKPEAPP-P 7550 

Qy 860 KEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFK-NIYPKDEVHVSDEFSENRSSVSK 918 

|| : | : I : : II : I : I I : I I I I 

Db 7551 KEPEPEKVIEKPKLKPRPPPPPPAPPKEDVKEKIFQLKAIPKKKV PENPQVPEK 7604 

Qy 919 ASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLP — SDTEKEDRSLSAVL 967 

: : I I 1 hill I I : I : I I 

Db 7 605 VELTPLKVPGGE KKVRKLLPERKPEPKEEWLKSVL 7 640 



RESULT 8 
T26213 

hypothetical protein W06A7.3b - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T26213 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A;Accession: T26213 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-222 <WIL> 

A;Cross-references: EMBL:Z78066; PIDN : CAB01523 . 1 ; GSPDB : GN00023 ; CESP : W06A7 . 3b 

A; Experimental source: clone W06A7 

C; Genetics : 

A; Gene: CESP : W06A7 . 3b 

A; Map position: 5 

A;Introns: 27/1; 77/2; 201/2 



Query Match 5.5%; Score 320; DB 2; Length 222; 

Best Local Similarity 32.1%; Pred. No. le-06; 

Matches 59; Conservative 47; Mismatches 78; Indels 0; Gaps 



QY 


976 


WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 

::|::|||| II: :| :| :| 1 : :::| 1 1 1 ll-l 1 1 
ILDVIYWRDAKKSAIVLSLALLVLFVLAKYPLLTV 


1035 


Db 


27 


86 


Qy 


1036 


QKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVL 

: | : | Ml I : : : : | 1 : : : 1 : 1 : : 1 1 : : : 1 : 1 1 : : 
KKTDSEHPFSEIIAQDLTLPQEKVHAQADVFVEHATCIANKLKKLVFVESPLESIKFGLV 


1095 


Db 


87 


146 


Qy 


1096 


MWFTYVGALFNGLTLLILALI SLFSI PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI 

: | | | : : I : I I I 1 1 1 : : 1 1 : 1 : 1 1 : 1 1 1 : 1 : : 1 : 1 1 : 
LWSLTYIASWFSGFTLAILGLLGVFSVPKVYESNQEAIDPHLATISGHLKNVQNIIDEKL 


1155 


Db 


147 


206 



QY 
Db 



1156 PGLK 1159 

I I : 
207 PFLR 210 



RESULT 9 
A47283 

calphotin - fruit fly (Drosophila melanogaster ) 
C; Species: Drosophila melanogaster 

C;Date: 21-Sep-1993 #sequence_revision 25-Apr-1997 #text_change 21-Jul-2000 
C; Accession : A47283 

R;Ballinger, D.G.; Xue, N.; Harshman, K.D. 

Proc. Natl. Acad. Sci. U.S.A. 90, 1536-1540, 1993 

A; Title: A Drosophila photoreceptor cell-specific protein, calphotin, binds 
calcium and contains a leucine zipper. 

A; Reference number: A47283; MUID : 93165730 ; PMID: 8434015 

A; Contents: photoreceptor cells 

A;Accession: A47283 

A; Status : preliminary 

A;Molecule type: nucleic acid 

A; Residues: 1-873 <BAL> 

A; Cross-references: GB:L05080; NID:gl57071; PIDN : AAA2 8420 . 1 ; PID:gl57072 
A;Note: sequence extracted from NCBI backbone (NCBIN : 124 958, NCBIP : 124 959 ) 
C; Super family: collagen alpha 1(1) chain; fibrillar collagen carboxyl-terminal 
homology; von Willebrand factor type C repeat homology 

Query Match 5.2%; Score 304.5; DB 2; Length 873; 

Best Local Similarity 21.9%; Pred. No. 3.1e-05; 

Matches 213; Conservative 126; Mismatches 362; Indels 273; Gaps 42; 



Qy 



62 AAGL S AAAVP P AAAAP L L D F S S D S VP PAP R — GPLPAAP — . — PAAP ERQPSWER 110 

:| ::| I I Ml: I :| II Mill IN M : 

D b 11 S APVAAP VT P S AVAAP VQWS PAAVAVAPAWAPAPAAPTAVT PVAP P PTLAS VQ PATVT 70 



Qy 



111 SPAAPAP SLPPAAAVLPSKLPEDDEPPARPPPPPP AGASPLAEPAAPP 158 

Mill I : I I : I i : I I I II I : : I : I I I 

Db 71 VP-APAPIAAASVAPVASVAPPWAAPTPPAASPVSTPPVAVAQIPVAVSAPVAPPVAAT 129 

Qy 159 STPAAPKRRGSGSVDETLFALP — AASEP VI P S S AEKIMDLMEQ PGNT VSSGQ 209 

| | I I I : I I II I I I I : I IN I - 

D b 130 PTPVAPI PVAAPVIATPPVAASAPTPAAVTPWSPVIATPPWPANTTVPVAAPV 184 

Qy 210 EDFPSVLLETAASL-PSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERA 268 

| : : ■ I I I : : : I I : : : : I I I : I : I 

Db 185 AAVPAAVP WAPVLAPAVAPAVAP WAET PAP P P VAEI PVA — TIPECVAPLIPEVSWA 242 

Qy 269 TNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKEDLVCSAALH 32 8 

|| I I : I I I : I 

Db 243 TKPLA AAE P VWAP PAT ET P WAP AAA 2 69 

Qy 329 SPQESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDV 388 

III I : I : : : I I I I I 

Db 27 0 SPHVS VAP AVETAWAPV 2 87 

Qy 389 LAARANVESKVDRKCLEDSLEQKSL GKDSEGRNEDASFPSTPEPVKDSSRAYITCA 444 

I: I I | : | :| : I : | | I I :: : 

Db 28 8 SASTEPPVAAATLTTAPETPALAPWAESQVAANTWATPPTPAPEPETIAPPWAE 34 4 

Qy 445 SFTSATESTTANTFPLLEDHTSENKTDEKKIEERKAQIITEKTSPKTSNPFLVAVQDSEA 504 

: Is : II:: :|: I I I 'II I : 



Db 345 TPEVASVAVAETTPPWPPVAAES 1 PAP WAT T P VP AT LAVT D P D - 389 

Qy 505 DYVTTDTLSKVTE^VSNMPEGLTPDLVQE^CESELNEATGTKIAYETKVDLVQTSEAIQ 564 

I I : I I : I : I I I I I I I I 
Db 390 VT AS AVP EL P P VI AP S P VP S A VAETPVDLA P 420 

Q y 565 ESLYP-TAQLCPSF— EEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAP— 619 

II |: I: II 11:1 : = I h :l : M :l III 

D b 421 P VL P PVAAEP VPAWAEET PET PAPASAPVT I - AALD — I P EVAPVI AAP S DAPAEAP S A 477 

Qy 620 --PPVSYDSIKLEPENPPPYEEAMNVALK7VLGTKEGIKEPESFNAAVQETEAPYISIACD 677 

Ml || hlllll::: 
Db 478 AAPIVS TPPT TAS VPETTAP PAAVPT E 504 

Qy 678 LIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE— LVEDSSPESEPVDLFSD-DS 72 9 

: : | : | : | : I : I : : I I I I : : I I : I I I 

D b 505 P I DVS VL S EAAI ET P VAP P VEVTT EVAVADVAP P EAAEDL 1 1 E P VE P PAP I P D L LEQTT S 564 

Q y 730 IPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAA 789 

: | | : : : : | | : I I I : : : I : I I : I : I : : 

Db 565 VPAVEAAESTSSPIPETSLPPPNEAVA SPEVAVAPITAPEPI PEP-EPSLATPTEPI 620 

Q y 790 SNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFV 849 

: | : : i : : I : : I I I I I I 

Db 621 PVEAPV VI Q EAVD AVE VP VT ET S T S I P ET T VE F P EAV 657 

Qy 850 SAKDDSPKLAKEYTDLEV-SDKSEIANIQSGAD SLPCLELPCDLSFKNI YPKDEVH 904 

: | | : I : I : : : : I II I I : I : I : I 
D b 658 AEKVLDPAI TEAPVTTQEPDVANINDGAPATEITTPAVEI VT 699 

Qy 905 VSDEFSENRSSVSKASISPSNVSALE-PQTEM— GSIVKSKSLTKEAEKKLPSDTEKEDR 961 

: | | : : : I : : I I : I I : : : I I I : I : : : 

D b 700 AAAEVS DT AI P LI D P P V- PQE I AVAE I P ET ET K P AEVI VEQ S - T I P I EAP VP EVS KYAE P 757 

Qy 962 SLSAVLSAELSKTS 975 

:| :l i: |: 
Db 758 VISEAPAAEVPITA 771 



RESULT 10 
T24583 

hypothetical protein T06D8.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T24583 
R; Palmer, S. 

submitted to the EMBL Data Library, April 1995 
A; Reference number: Z19909 
A;Accession: T24583 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 

A; Residues: 1-1829 <WIL> 

A; Cross-references: EMBL:Z49130; PIDN : CAA88964 . 1 ; GSPDB : GN00020 ; CESP:T06D8.1 

A; Experimental source: clone T06D8 

C; Genetics : 

A; Gene : CESP : T06D8 . 1 

A;Map position: 2 



A;Introns: 1391/3; 1432/3; 1470/3; 1505/1; 1520/1; 1616/1; 1644/1; 1687/3; 
1742/1 



Query Match 5.2%; Score 302.5; DB 2; Length 1829; 

Best Local Similarity 21.1%; Pred. No. 0.00011; 

Matches 233; Conservative 157; Mismatches 469; Indels 245; Gaps 38; 

Qy 2 EDIDQSSLVSSSTDSPPR PPPAFKYQFVTEPEDEE DEEEE E 42 

I : : : | | | : I I I III II : I I I 

Db 212 EETTWAWESSGEEPASSSTSIPTELSKNDQVTEASGEETITAAATEASEETTTSAVTE 271 

Qy 4 3 DEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRG PLPAAPP 9 9 

|| : : 1 : : | | : I : : : I : I I I I I 

Db 272 GSGEDTTWAWELSGEQPAS — SSTSIP TELSKDDQVTEASGEETTTAAATE 322 

Qy 100 AAPERQPS WERSPAAPAPSLPPAAAVLPSKLPEDDE-PPARPPPP 143 

| : I I II Ml : : | :: | : I I : I 

Db 323 ASEETTTSAVTEGSGEETTWAWESSGEEPASS STSIPTELSKDDQVTEASGEET 378 

Q y 144 P PAG AS P LAE P AAP P S T P AAP KRRGS GS VDET L FAL PAASEPVIPSSAEKIMD 196 

I I : : I I : III I h h I I : I M : I 

D b 379 TTAAATEASE ETTTSAVTEGSGE-DTTWAWESSGEQPASSSTSI PTELSKDDQ 432 

Qy 197 LMEQPGN TVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEH 237 

: | | III:: : : | : : I : I I : 
Db 433 VTEASGEETTTAAATEASEETTTSAVTEGSGEDTTVVAWESSGEQPASSSTSIPT 48 8 

Qy 238 GYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPK 297 

I | : : | I I I I : I I I I | : : : : I I : 

Db 489 -ELSKDDQVTEASG— EETTTAAATEASEETTTSAVTEGSGEETTV-VAWESSGEEPAS 544 

Qy 298 GESAILVENTKEEVIVRSKDKEDLVCSAALHSPQESPV GKEDRWS PEKTMDI 350 

: : | I : I : : : : : I : I I : I : I : I M : : : I 

D b 545 SSTSIPTELSKDDKVTEASGEETTTAAATDASSEETTTSAVTEGSGEETTWAVVESSD- 603 

Qy 351 FNEMQMS VVAP VREEYAD FKP FEQAWEVKDT YEGS RDVLAAK^VE S KVDRKCLED S LEQ 410 

| | : I I : I : I I II I I I I: 

Db 604 -EEPASSSTSIPTELSKDDQVTEASGEETTT AAATEASEETTTSAVTEGSGEE 655 

Qy 411 KSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKT 470 

:: | |: : I I I : I : I I I hill : h = 

Db 656 TTWAWESSGEEPASSSTSIPTELSKDDKVTEA SGEETTTAAATDASSEETTTSAV 712 

Qy 471 DEKKIEERKAQIITEKTSPK-TSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTP 529 

| || : | : : | : : : |: I I I : : I I I : II 

D b 713 TEGSGEETTWAWES SDEEPASS STS I PTELSKDDQVTEASGEETTTAAATEASEETTT 772 

Q y 530 DLVQEACESELN EATG TKIAYETKVDLVQTSEAIQESLYPTAQLCPSF 577 

|| I I :: I 11111:111 I = 

Db 773 SAVTEGSGEETTWAWESSGEEPASSSTSIPTELSKD-DQVTEASGEE— TTTAAATE 828 

Qy 578 EEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPY 637 

| | I I : : ::: |:| lh I I :| 
Db 829 ASEETTTSAVTEGSGEDTTWAWESSGE QPASSSTSIPTELS 871 

Qy 638 EEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSE 697 

|: : h : I I : | : : | | I : 



Db 



872 



KDDQVTEASGEETTT AAATEASEETTTSAVTEGSGEETTV 911 



Q y 698 IAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEV— PQTQEEAVMLMKESLTEVSET 754 

: | | | | I I I I I : I I I : I I I I I : I 

Db 912 VAWESSGEEPA SSSTSIPTELSKDDQVTEASGEETTTAAATEASSEETTTSAVT 966 

Qy 755 VAQHKEERLSASPQELGKPYLESFQP NLHSTKDAASN 791 

:| I I : I: II : |:::::| 

Db 967 EGSGEETTTSAVTEGSGEETTTSAVPEGENSTTEAPAFVTGSEIEIPSSEESSSTTTHDP 1026 

Qy 792 DIPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESET-FSDSSPIEIIDEFPTFVS 850 

II :| I :| :| : I : = I I : I I : I I I lh 

Db 1027 SIPVITPKPSVSSTIENVMSKTSSEE AAEKKI I GEHQTGKDDDAGKEDEDNMPAFVT 1083 

851 A KDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCD 891 

| : I : : I : I I I : I : : : I 
1084 ANPAGTSTTESAENVTSTGEEDENIKMAKELGKQFAADLAKLA 1126 

Qy 892 LSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKK 951 

| | | : : : | : : : I I : I I : I : : t I : : : I I : : 

Db 1127 AKDGVNLT- ETADAKDSGETAHVEDEQVS STE — S S I GSEETTTTVNKETTEE 117 6 

Qy 952 LPSDTEKEDRSLSAVLSAELSKTS 975 

: : I I : : I I I : 

Db 1177 HHEASGEEDDAPAFVTGAPTDSTT 1200 



Qy 

Db 



RESULT 11 
A47282 

calcium-binding protein calphotin - fruit fly (Drosophila melanogaster ) 
C; Species: Drosophila melanogaster 

C;Date: 21-Sep-1993 #sequence_revision 18-Nov-1994 #text__change 21-Jul-2000 
C;Accession: A47282 

R;Martin, J.H.; Benzer, S.; Rudnicka, M. ; Miller, C.A. 
Proc. Natl. Acad. Sci. U.S.A. 90, 1531-1535, 1993 

A;Title: Calphotin: a Drosophila photoreceptor cell calcium-binding protein. 

A; Reference number: A47282; MUID: 93165729; PMID: 8094559 

A; Access ion: A47282 

A; Status : preliminary 

A; Molecule type: nucleic acid 

A; Residues: 1-865 <MAR> 

A;Cross-references: GB:L02111; NID:gl57031; PIDN : AAA2 8405 . 1 ; PID:gl57032 
A; Experimental source: photoreceptor cells 

A;Note: sequence extracted from NCBI backbone (NCBIN: 124955, NCBIP : 124956) 

C; Genetics : 

A; Gene: FlyBase:Cpn 

A; Cross-references : FlyBase : FBgn0 010218 

C;Superfamily: collagen alpha 1(1) chain; fibrillar collagen carboxyl-terminal 
homology; von Willebrand factor type C repeat homology 
C; Keywords: calcium binding 

Query Match 5.1%; Score 2 95.5; DB 2; Length 8 65; 

Best Local Similarity 21.3%; Pred. No. 7.2e-05; 

Matches 217; Conservative 127; Mismatches 379; Indels 295; Gaps 41 

Qy 62 AAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPER QPSWERSPAAPAP 117 

: I : : I Mill: I : I I I I I : I I I I I : I I'll 



Db 



11 SAPVAAPVTPSAVAAPVQWSPAAVAPAPAAPIAVTPVAPPPTLASVQPATVTIP-APAP 69 



Qy 118 SLPPAAAVLPSKLPEDDEPPARPPPPPPAGA SPLAEPAAPPSTP 161 

I : | | : | | : III III = I : I I I I I 

Db 70 IAAASVTPVASVAPPWAAPTPPAASPVSTPVAVAQIPVAVSAPVAPPVAATPTPWQIP 129 

Qy 162 - AAP KRRG S G S VDET L FAL P — AAS E P VIPSSAEKIMDLMEQPGNT— VSSGQED 211 

Ml : I I I I I I 11:1 I I I I- 

D b 130 VAAP VIATPPVAASAPTPAAVTPVISPVIASPPWPANTTVPVAAPVAA 17 8 

Qy 212 FPSVLLETAASL-PSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATN 270 

|: : || |:::| I :: : : || | : |: II 

D b 179 VPAAVPWAPVLAPAVAPAVAPWAETPAPPPVAEIPVA — TI PECVAPLI PEVSWATK 236 

Qy 271 PFVNRDLAEFSELEYSEMGSSFKGSPKGESAILVENTKEEVIVRSKDKEDLVCSAALHSP 330 

| I I : I I I : I I I 

Db 237 PLA AAE P VWAP P AT ET P WAPAAAS P 263 

Qy 331 QESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLA 390 

| |:| :: :H III 
Db 264 HVS VAP AVETAWAPV 27 9 

Qy 391 ARANVESKVDRKCLEDSLEQKSL GKDSEGRNEDASFPSTPEPVKDSSRAYITCASF 446 

I: I I I : I :| = I : I II I :: 

Db 2 80 - S AS T E P PVAAAT LTT AP ET P ALAP WAE SQVAANTWAT P PT P AP E PET I AP P WAET P 338 

Qy 447 TSATESTTANTFPLLEDHTSENKTDEKKIEERKAQIITEKTSPKTSNPFLVAVQDSEADY 506 

| : : ||:: : I : I I I : I I I : 
Db 339 EVASVAVAETTPPWPPVAAES 1 PAP WAT T P VP AT LAVT D P D 381 

Qy 507 VTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQES 566 

I I : I I : I : I I I Mill 
D b 382 VTASAVPELPPVIAPSPVPSA VAETPVDLA PPV 414 

Qy 567 LYP-TAQLCPSF— EEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAP 619 

|| I : I : II I I : I : : | | : : I : I I : I I I I 

Db 415 L P P VAAE P VP AWAE ET P ET P AP AS AP VT 1 - AAL D — IP EVAP VI AAP S D AP AEAP S AAA 471 

Qy 620 PPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACD-- 677 

| M II 1 = 1 il II : 

Db 472 PIVS TPPT TASVPETTAPPAAVPTEPI 498 

Qy 678 LIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE — LVEDSSPESEPVDLFSD-DSI P 731 

: : | : | : I : I : I : : I I I : : I' I : I I I : I 

Db 499 D VS VL S EAAI ET P VAP P VEVT T E VAVAD VAP P EAAAD L 1 1 E P VE P PAP IPDLLEQTT S VP 558 

Qy 732 EVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAASN 791 

| : : : : I I : I II : : : I : I I : I : I : : 

Db 559 AVEAAESTSSPIPETSLPPPNEAVA SPEVAVAPITAPEPIPEP-EPSLATPTEPIPV 614 

Qy 792 DIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSA 851 

: | :: I : : I : : I I I I I I : 

Db 615 EAPV VI Q EAVDAVEVP VT ETSTSIPET T VE F P EAVAE 651 

Qy 852 KDDSPKLAKEYTDLEV-SDKSEIANIQSGAD SLPCLEL PCDLSFKNIYP- 899 

| | : I : I : : : : I I I I I : I : I : I : : I I 

Db 652 KVLDPAI TEAPVTTQEPDVANINDGAPATEITTPAVEIVTAAAEVSDIAI PVIDPP 7 07 



Qy 900 -KDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMG - 936 

|: |: | I : :: : I : I I : 

Db 708 VPQEIAVA-EIPETDTKPAEVIVEQSTIPIEAPVPEVSKYAEPVISEAPAAEVPITAGDN 766 

Qy 937 SIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGV 990 

| : | : : I | I : I I : I : : | | | I :: I I 

Db 767 PDNTSVGISEWPTIAEKPVEEVPTSEIPEQSSSPSDSVPVAKITPLL--RDLQTTDV 822 



RESULT 12 
T34513 

hypothetical protein ZK783.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 29-Oct-1999 #sequence_revision 29-Oct-1999 #text_change 29-Oct-1999 
C; Accession: T34513 
R;Favello, A.; Vaudin, M. 

submitted to the EMBL Data Library, August 1994 

A; Description : The sequence of C. elegans cosmid ZK783. 

A; Reference number: Z21536 

A; Access ion: T34513 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-3507 <FAV> 

A;Cross-references: EMBL:U13646; PIDN : AAC24418 . 1 ; GSPDB : GN00021 ; CESP:ZK783.1 

A; Experimental source: strain Bristol N2; clone ZK783 

C; Genetics : 

A;Gene: CESP:ZK783.1 

A;Map position: 3 

A;Introns: 14/1; 48/2; 84/1; 196/3; 303/1; 381/1; 586/1; 605/1; 1175/3; 1207/1; 
1409/2; 1450/1; 1760/1; 1898/1; 2153/1; 2628/1; 2681/1; 2899/1; 2946/1; 3088/3; 
3184/1; 3283/1; 3346/3; 3365/3; 3484/3; 3504/1 

Query Match 5.0%; Score 292; DB 2; Length 3507; 

Best Local Similarity 20.6%; Pred. No. 0.00072; 

Matches 226; Conservative 146; Mismatches 491; Indels 236; Gaps 37; 

ISSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDL — EELEVLERKPAAGLSAA 68 

| | : : : I | | : : I I : : t i I I : I I I : I 

JSSSEAPLTSSPATTTEVITESSVKSTTPKEESSSEITVKLSSKSPEVTESSVKSSPSTP 2102 

^VPPAAAAPLLDFSSDSVPPAPRGPLPAAPP AAPERQPSWERSPA APAPSL 119 

: : : I I : I : : I : I : I I I : M 

iTTSQSVTSTVPETSKSTVLSSEAPVTSTSPTEVHTSSETKPSLSASSTTGDTNSTTPST 2162 

3 PAAAVLPSKLPEDDEPPARPPPPPPAGASP-LAEP AAP P S T P AAP KRRGS G S V 172 

I : I : I t I I : I I : :: I I III: M 



Qy 


11 


Db 


2043 


Qy 


69 


Db 


2103 


Qy 


120 


Db 


2163 


Qy 


173 


Db 


2221 


Qy 


216 


Db 


2281 



- LFALPAAS EP VI -PS SAEKIMDLMEQPGNTVS S GQEDFP SV 215 

: I : : I M II : : III : I III I I 



: I I : I I I I : I I | | : : : I : I I II 



Qy 271 PFV NRDLAEFSEL EYSEMGSSFKGSPKG ESAI LVENTKEEVIVRSKD 317 

| : : | : : I : : I I : I I I I : : : I I : 

Db 2341 PDILSTTTNNLSQSSTVSTEDRSEISSENSEKPTSAPELVTSSVTHVASSSPDVPTESSE 2400 

Qy 318 KEDLVCSAALHSPQ ESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFK 37 0 

: | | | : : I : I : III : | : I I : 

Db 24 01 PDDLTGSSTENIPEASSKQTISSTPTPDTTTASEEPTKSTSMSPDLSTTSNVLSESSTTP 24 60 

Q y 371 PFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTP 430 

| : M : : I I I : I I I I : : I I 

2461 ESSSKSPVSSSTEG— ISWTSTEFSKVPESTISSVLE EDLT-KTTP 2504 



Db 

Qy 



431 E P VKD S S RAY I T CAS FT SAT ESTTANTFPLLEDHTSENKTDEKKIEE 477 

| : : : I I I I I III:: I : : : : I : 

Db 2505 SPILEET TTASETSEPLTEDSLTVSVRIHELTTSSENVPKESESTTTSSESSKPSQ 2560 



Qy 478 RKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGL — - — TPDLV 532 

I I : I II: I : : I I : I : : I I : : 1=1 : I I 

Db 2561 EPAGILTSTVWPTSSVSLITASEIEA-- ITSNTPFKQGRTPITTSPKSLVKSTTSPSTV 2618 

Qy 533 --QEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPD 590 

IN | : : | : I I I : : : I : I : I : : : I I 

Db 2619 TSSEPSESTKRTTVSTTVSTTTPTEETTTSESLILTAAPSK PTESTTESSEAPTTP- 2674 

Qy 591 IVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGT 650 

I I : I I ||::: 
Db 2675 AKTSETKPS NVSSTSRKS 2692 

Qy 651 KEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE 710 

| : : | : : : : : III I : : : I I I 

Db 2693 TENVETSTSQSGSLESS TMSSTSSEPETNAPAVTVSSEASSTTLEE 2738 

q y 711 LVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKE-ERLSASPQE 769 

| | | | : || Ml: : I : : : I : I : I : I I 

Db 2739 NSSTSSPTSSEASVKLSSLFPE— SITSEAVTVSSRAPAEITMSSESHREISTVSSEPSE 2796 

Qy 770 LGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSS 822 

I : I I : : I : I I : I I : I I I : I 

Db 2797 PEIPLSTTVSPNWTASSIPSEE-PILSSVTSSSTPRVRLITG— TPDDLIVSVTVPSHG 2853 

Qy 823 -KEDKIKESETFSDS-SPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGA 880 

: II I : I I II : I I : I I I I I 

Db 2854 NRRQNITASSVPSNSTSPIILPSESLTTPQPPPTTTTTAKPAT TSGK 2900 

Q y 881 DSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVK 940 

I : : I : : : I III I 
Db 2901 RGPPSIQPPAEM -FTTPAPPPPSNGGYGEE 2929 

Qy 941 SKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLL 1000 

I : I : : : I I I I I : : : I : IN I : 

Db 2930 TNQEEEQVTSTTTTEAPSLCSTVTCHSLATCE QSTGVCICRDGFIGD 2976 

Qy 1001 SLTVFSIVSVTAYIALALL 1019 

III I : I I 

Db 2977 GTTACSKKSTADCI SLPSL 2995 



RESULT 13 
A56577 

microtubule-associated protein MAP IB - rat (fragment) 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 21-Jul-1995 #sequence_revision 21-Jul-1995 #text_change 16-Feb-1997 
C;Accession: A56577 

R;Zauner, W. ; Kratz, J.; Staunton, J.; Feick, P.; Wiche, G. 
Eur. J. Cell Biol. 57, 66-74, 1992 

A;Title: Identification of two distinct microtubule binding domains on 
recombinant rat MAP IB. 

A; Reference number: A56577; MUID : 92347374 ; PMID: 1639092 

A;Accession: A56577 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-2364 <ZAU> 

A;Cross-ref erences : GB:X60550 

A; Experimental source: brain 

A;Note: nucleotide sequence not given; conceptual translation not complete 
C;Superf amily : microtubule-associated protein MAP IB 

Query Match 5.0%; Score 291.5; DB 2; Length 2364; 

Best Local Similarity 20.0%; Pred. No. 0.00043; 

Matches 220; Conservative 167; Mismatches 456; Indels 255; Gaps 44; 

Qy 30 TEPEDEEDEEEEEDEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPA 89 

: I I I I : I : : II |:| : :: I I II III:: I : 
D b 913 SEEEGEEEEDKAEDAREEDHEPDKTE AEDYVMAWDKAAEAGVTEDQYDFL 963 

Qy 90 PRGPLPAAPPAAPERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGAS 149 

|:| I : II II::: II II I I 

D b 964 GTPAKQ PGVQSPSREPASSIHDETLPGGSESEAT AS 999 

Qy 150 PLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGN TVS 206 

| | M I : : II I I : I I I I 

Db 1000 DEENREDQPEEFTAT SGYTQST IEI SSEPTPMDEMSTPRDVMTDETNNEETES 1052 

Qy 207 SGQE DFPSVLLETAASLP SLSPLS TVS FKEHG YLGNL S AVS S S EGT I E 254 

II : I I 111:11 I : I : I : I I 

Db 1053 PSQEFVNITKYESSLYSQEYSKPWAS FNGLSDGSKTDATDGRDYNASASTI SPPSSMEE 1112 

Qy 255 ETLNEAS KELPERATNPFVNRDLAEF--SELEYSEMG 2 89 

: : : : : I : : : : I : I : I : I : : I 

Db 1113. DKFSKSALRDAYRPEETDVKTGAELDIKDVSDERLSPAKSPSLSPSPPSPIEKTPLGERS 1172 

Qy 290 SSFKGSPKGESAI LVENTKEEVIVR SKDKEDL VCSAALHSP 330 

: | | : | I : : I : : I I : : : I I : I I : I 

Db 1173 WFSLTPNEI KASAEGEATAVVS PGWQAVVEEHCAS PEEKTLEWS PSQS VTGSAGHT P 1232 

Qy 331 -QESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFK PFEQAWEVKD 380 

: | | :: : I I : : I I I : : I I :: I I 

Db 1233 YYQSPTDEKSSHLPTEVT ENAQAVPVS FE FT EAKDENERS SIS PMDE- - PVPD 1283 

Qy 381 TYEGSRDVLAARANVESKVDRKCLED — SLEQKSLGKDS EGRNEDASFPSTPEPVK 434 

: ||: : I I I : I : II : I 11 = 1 I M 

Db 1284 SESPIEKVLSPLRSPPLIGSESAYEDFLSADDKALGRRSESPFEGKNGKQGFSDKESPVS 1343 



Qy 



435 DSSRAYITCAS FTSATESTTANTFPLLEDHTSENKT 



DEKKI 475 



Qy 



Qy 



| : | : I I i : i I : I I I I : t : 

Db 1344 D LTSDLYQDKQEEKRAGFIPIKEDFSPEKKASDAEIMSSQSALALDERKLGGDGS 1398 

Qy 476 EERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAV 520 

|: | | I I:: I :|: I : :: I: hi 
Db 1399 PTQVDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGAEDT Y S HMEGVAS VS T AS VAT 1455 

Qy 521 SNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQES-LYPTAQLCP S 576 

|: II I I I : :|: h: I =111 lh : h : II I 

Db 1456 SSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSKEECPRPMS 1514 

577 FEEAEATP SPVLPDIVMEAPLNSLL P SAGAS WQ 610 

: : | I : : h I : I I h I I : : 

Db 1515 ISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSLAMDFSRQSPDHPTVGAGMLH 1574 

611 PSVSPLEAPPPVSYD SIKLEPENPPPYEEAMNVALKALGTKEGIKEPESFN 661 

: : III I h I ||:::::: : I : : 

Db 1575 ITEN GPTEVDYSPSDIQDSSLSHKIPPTEEPSYTQDNDLS-ELISVSQVEASPSTSS 1630 

Qy 662 AAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEP 721 

| II I ::| I I I h :l : 1*1 Ihh 
Db 1631 AHTPS QIASPLQEDTLSDWPPRDMSLYASLASEKVQSLEGEKL SPKSDI 1680 

Qy 722 VDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYL-ESFQP 780 

| : | I : : I I I I : I : : : I I I = : I 

Db 1681 SPLTPRESSPTYSPGFSDSTSGAKES-TAAYQTSSSPPIDAAAAEPYGFRSSMLFDTMQH 1739 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

: | : : | : : : : I : : I I I : I : : : : I I h 

Db 1740 HLALSRDLTTSSV EKDNGGKTPGDFNYAYQKPESTTESPDEEDYDYE SHEKTIQ 1793 

Qy 841 IIDEFPTFVSAKDDSPK LAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKN 896 

| : : : | I : I : I : : I I h:: I 

Db 1794 AHDVGGYYYEKTERTIKSPCDSGYSYETIEKTTKTP EDGGYS CEITEKT 1842 

Qy 897 IYPKDEVHVSDEFSENRSSVSKAS ISPSNVSALEPQTEMGSIVKSKSL 944 

: | | | | | : : I : : : : I : I 

Db 1843 TRTPEEGGYSYEISEKTTRTPEVSGYTYEKTERSRRLLDDISNGYDDTEDGGHTLGDCSY 1902 

Qy 945 TKEAEKKLPSDTEKEDRS 962 

: I : I : I II I 

Db 1903 SYETTEKITSFPESESYS 1920 



RESULT 14 
T19431 

hypothetical protein C25A1.10 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 21-Jan-2000 
C; Access ion: T19431 
R;Mortimore, B. 

submitted to the EMBL Data Library, October 1996 
A; Reference number: Z19124 
A; Accession: T19431 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-971 <WIL> 



A;Cross-references: EMBL:Z81038; PIDN : CAB02755 . 1 ; GSPDB : GN00019 ; CESP : C25A1 . 10 

A; Experimental source: clone C25A1 

C; Genetics : 

A; Gene: CESP : C25A1 . 10 

A; Map position: 1 

A;Introns: 38/3; 92/3; 201/3; 919/3 

C;Superfamily: nucleolus-cytoplasm shuttle phosphoprotein 

Query Match 4.9%; Score 288.5; DB 2; Length 971; 

Best Local Similarity 20.6%; Pred. No. 0.00016; 

Matches 198; Conservative 127; Mismatches 391; Indels 247; Gaps 35; 

Q y 7 SSLVSSSTDSPPRPPPAFKYQFVTEP EDEEDEEEEEDEEEDDEDLEELEVLERKPAA 63 

Mllllhll : :| : : : : : |: II :: :: I I 
Db 147 SSSDSDSDDEPPKKAPAVTTKVAPKPMAKKQDTSDSDSDSEDSDDGKSKKANPVKVTPVA 206 

Q y 64 GLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAP ERQPSWE 109 

: | ||: III I I : I I I I I I : : I : 

Db 207 NVLQKWAKKAASSSSDSSDDEKKPAAK-PTPAKPTPKPWKKAESSSDSSDDEKKPVAK 265 

Qy HO RSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGS 169 

:|| III I Mill: || I : I : I | : ::: 

Db 266 PAPAKATPK- PAAKKADS S S DS S DDEAPAK-KT PAKAAPKPVAKKAES S SDS SDDEKK 321 

Qv 170 GSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPL 229 

Ml |: |: : : : I :| : : | : : I : 

Db 322 PAAK PTPAKATPKPVAKKAESSSDSSDDEKKPVAKPAPAKATPKPV 367 

Qv 230 STVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSE— 287 

: | |||: : :| I : I I : I I I : h I 

Db 368 AK KAESS SDS SDDEK-KPAAKPTPAKATPKPV7VKKAESS SDS SDDEKK 414 

Qv 288 MGS S FKGS PKGESAI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS P 344 

: I I : I I =M 1:= I 1:1 I = 1 

Db 415 PVAKPTSAKATPK PAAKKAD- — S S SDS S DDEAPAKKT PAKAAP 455 

Qy 345 e KTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRD VLAARANV 395 

: | : : : : I I I :| : I I I I 

Db 456 KPASKKAESSSDSSDDEKPAAKSTPAKITPKPTAKKVASSSSDSSDDEKKPAAKPTPANA 515 

Qy 396 ESKVDRKCLEDSL EQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT SAT 450 

I I I I |:| : I : : Hhl : : 

Db 516 TPKPVAKKAESSSDSSDDEKKPVAKPTSAK ATPKPAAKKADLSSDFSDDEAPA 568 

Qv 451 ESTTANTFPLLEDHTSENKTDEKKIEERKAQIITEKTSPK-TSNPFLVAVQDSEADYVTT 509 

: | | I :|: :| I: |: 11=11 I = = M I 

Db 569 KKTPAKAAPKPASKKAESSSDSSDDEKPAAKSTPAKTTPKPTAKKAASSSSDSSDD— EK 62 6 

Qv 510 DTLSKVTEAAVSNMPEGLTPDLVQEACESELN EATGTKIAYETKVDLVQTSEAI 563 

: : I I I : I I :: : I : I M = IN :l== 

Db 627 KPVAKPTSAKATPKPAAKKADS S SDS S DDEAPAKKT PVKPTPVKI VAK- KVD S S S DS S 683 

Q y 564 QE S L Y PT AQLC P S FE EAEAT P S P VL P D I VMEAP LN S LL P S AGAS WQ P S VS P LEAP P P VS 623 

: || :M1 II I: I II I 
Db 684 DDEKKPT KATPVKVTP KSVTKKAAAS S 710 

Qy 624 YDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETK 683 



Db 



Db 



M : I I : : I I I : I I : = II 
711 S D S — S DDEKKPWKQT PNW PKKEKAASSSDDSS DDEKK 748 



Qy 684 LSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVML 743 

: : | : | I : I : : I II : I I I I I 
74 9 PTAKPTP KATPKQSAKKADSSDDS SDDEAPA 779 



QY 744 MKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKIS 803 

I 

Db 7 8 o KKT PAK S T P AKT AVKKEAS S 799 

Qy 804 LQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS PKLAKE 8 61 

| : | | : : I I : I : I : I : : : I : I I M : I 

Db 800 SSDDSSDDEKTKKKSATTPAKSTPKTALKKAESSDSSDDDEDLPKPSKA 848 

Qy 862 YTDL-EVSDKSEIANIQSGADSLPCLEL-PCDLSFKNIYPKDEVHVSDEFSENRSSVSKA 919 

| : : | | | : : I I : I I : :: II I " I 

Db 849 WPRPQRADSEESAETEESSSRTPALKAKPLATSTEKAVYENRKRKSSPF RRVQMTKD 906 

Qy 920 SIS 922 

I : I 

Db 907 SVS 909 



RESULT 15 
QRMSP1 

microtubule-associated protein MAP1B - mouse 

N; Alternate names: microtubule-associated protein MAPI (X) ; microtubule- 
associated protein MAPI. 2; microtubule-associated protein MAPS 
C; Species: Mus musculus (house mouse) 

C;Date: 30-Sep-1991 #sequence_revision 30-Sep-1991 #text__change 01-Sep-2000 
C;Accession: S07549; S44387; A33645 
R;Noble, M. ; Lewis, S.A.; Cowan, N.J. 
J. Cell Biol. 109, 3367-3376, 1989 

A; Title: The microtubule binding domain of microtubule-associated protein MAP IB 

contains a repeated sequence motif unrelated to that of MAP 2 and tau. 

A; Reference number: A33645; MUID: 90094539; PMID:2480963 

A;Accession: S07549 

A; Molecule type: mRNA 

A; Residues: 1-2464 <NOB> 

A;Cross-references: EMBL:X51396; NID:g52999; PIDN : CAA35761 . 1 ; PID:g53000 
R;Sanchez, C; Padilla, R. ; Paciucci, R. ; Zabala, J.C.; Avila, J. 
Arch. Biochem. Biophys . 310, 428-432, 1994 

A;Title: Binding of heat-shock protein 70 (hsp70) to tubulin. 

A; Reference number: S44387; MUID : 9423472 0 ; PMID: 8179328 

A; Accession: S44387 

A; Status: preliminary 

A;Molecule type: protein 

A;Residues: 653-663, 'IC* <SAN> 

C;Superfamily: microtubule-associated protein MAP IB 

C; Keywords: microtubule binding; phosphoprotein; tandem repeat 

F;589-786/Domain: microtubule binding #status experimental <MTB> 

F; 589-592, 639-642, 649-652, 655-658, 660-663, 668-671, 674-677, 679-682, 683-686, 687- 

690,691-694,695-698,699-702,708-711,712-715,716-719,720-723,727-730,758-761,764- 

767,783-786/Region: 4-residue repeats (K/R-K-E/D-X) 
F; 18 61-2 064 /Region : 17-residue repeats 



F;91 r 116, 351,888,1124,1153, 1168, 1208,1662, 1877,1918, 2003,2030, 2054, 2083/Binding 
site: phosphate (Ser) (covalent) #status predicted 

F; 147, 969, 1336, 1562, 1563, 1702, 1708, 1990, 2057, 2063, 2419/Binding site: phosphate 
(Thr) (covalent) #status predicted 

F; 1953/Binding site: phosphate (Tyr) (covalent) #status predicted 

Query Match 4.9%; Score 2 84; DB 1; Length 24 64; 

Best Local Similarity 20.7%; Pred. No. 0.00093; 

Matches 233; Conservative 159; Mismatches 424; Indels 312; Gaps 51; 

Qy 31 EPEDEEDEEEEEDEEED DEDLEELE VLERKPAAG LSAAAVP 71 

111:11111:11 : I I I I : = M I : I 

Db 1009 EAEQSEEEGEEEDKAEDAREEGYEPDKTEAEDYVKAVADKAAEAGVTEEQYGYLGTSAK^ 1068 

Qy 72 PAAAAPLLDFSS DSVPPAPRGPLPAAPPAAPERQP SWERSPAAPA 116 

| : | : : I :::| M Ml I : ' 

Db 1069 PGIQSPSREPASSIHDETLPGGSESEATASDEENREDQPEEFTATSGYTQSTIEISSEPT 1128 

Q V H7 P SLPPAAAVLPSKLPEDDEPPARP PPPPPAGASPLAEPA— 155 

| I : I: I I:: | | : |:| : 

Db H29 PMDEMSTPRDVMSDETNNEETESPSQEFVNITKYESSLYSQEYSKPAVAS FNGLSEGSKT 1188 

156 APPSTPAAPKRRGSGSVDETLFALPA AS EPVI P S S AEKIMDLMEQ 200 

| M : I I :: I M I : I —Ml : I : : 

Db 1189 DATDGKDYNASASTISPP SSMEEDKFSKSALRDAYCSEEKELKASAE LDIKDV 1241 

Qy 201 PGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEA 260 

: I I I I I 1 I I I M I 
Db 1242 SDERLS PAKSPSLSP SPPSPIEKT 1265 

Qy 261 SKELPERATNPFVNRDLAEFS ELEYSEMGSS FKGS PKGESAI LVEN TKEEVI VR 314 

|||:| II |:: I I : II |:: |: : II : 

Db 1266 — PLGERSVN FS LTPNEI KVSAEGEARSVS PG VTQAWEEHCAS PEEKTLE 1314 

Qv 315 SKDKEDLVCSAALHS P-QESPVGKEDRWS PEKTMDI FNEMQMS WA- PVREEYADFK- - 370 

| :| |:| :|| : ||: : |: : I II |::: I 

Db 1315 WSPSQSVTGSAGHTPYYQSP TDEKSSHLPTEVSENAQAVPVSFEFSEAKDE 1366 

Qv 371 PFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLED— SLEQKSLGKDS 417 

| :: I I: IM : II I : I IM I 

Db 1367 NERASLSPMDE — PVPDSESPVEKVLSPLRSPPLLGSESPYEDFLSADSKVLGRRSESPF 1424 

Qy 418 EGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKT 470 

||:| II Ml: : : I : M II I II 

Db 1425 EGKNGKQGFPDRESPVSDLT STGLYQDKQEEKSTGFIPIKEDFGPEKKTSDVETMS 1480 



Qy 



Qy 



471 DEKKI EERKAQIITEKTSPKTSNPFLVAVQDSEAD 505 

||:|: I: I I I I : : I I M 

Db 14 81 SQSALALDERKLGGDVSPTQIDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGV— AEDT 1538 

Qy 506 YWTDTLSKVTEAAV--SNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAI 563 

| : :: |: |:| |: II III : : I : I" I MM 

Db 1539 YSHMEGVASVSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTF 1597 

Qy 564 QES-LYPTAQLCP— SFEEAEATP S PVLPDIVMEAPLNSLL 601 

M : : I : : M I : M I : : Ml M 

Db 1598 QETEMSPSKEECPRPMSISPPDFSPKTAKSRT PVQDHRSEQSSMSIEFGQESPEHSFAMD 1657 



Qy 


602 


PSAGASWQPSVSPLEAPPPVSYD SIKLEPENPPPYEEAMNVA 

i , iiii. . . 1 || 11*1 11: : : : 
I : I 1 1 1 . . i ii ii*i t i • 

FSRQSPDHPTLGASVLHITEN GPTEVDYSPCDIQDSSLSHKIPPTEEPSYTQDNDLS 


644 


Db 


1658 


1714 


Qy 


645 


LKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKS 
. . . i . . i ii I - • I 1 • 1 1 • : 1 


704 


Db 


1715 


: : : I : - I 1 1 1 - * I 1 • 1 1 • • i 

-ELISVSQVEASPSTSSAHTPS QIASPLQEDTLSDWPPREMSLYASLA 


1762 


Qy 


705 


VPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVML MKE S LT EVS ETVAQH KE 

.it. ill- 1 1 1 t 1 1 • E * 1 (ill 
: 1 1 : III: Ml II 1 ■ 1 • 1 • nil 

SEKVQ — SLEGEKLSPKSDIS PLTPRESSPLYSPGFSDSTSAAKETAAAH — 


760 


Db 


1763 


1810 


Qy 


761 


ERLSASP QELGKPY LESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFN 

i . i i .ii * • i - I -l • • • 1 ■ * 1 1 
: | : | | -11 . . l . | . | . . . . | . .m 

- QAS S S P P I DAAT AE P YG FRS SML FDTMQHHLALN RDLTT S S V EKDSGGKTPGDFN 


810 


Db 


1811 


1865 


Qy 


811 


TAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDSPK LAKEYTDLE 

i . i .... i i i i • 1 1 ■ 1 


866 


Db 


1866 


| : : | : : : : | 1 1 1 • • ■ 1 1 • l 

YAYQKPENAAGS PDEEDYDYE- - SQEKTI RTHDWRYYYEKTERTI KS PCDSGYS YET I E 


1923 


Qy 


867 


VSDKSEIANIQSGADSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKAS 

: |: : I |::: 1 :| MM' : 1 
KTTKTP- EDGG YTCEITEKTTRTPEEGGYSYEISEKTTRTPEVSGYTYEK 


920 


Db 


1924 


1972 


Qy 


921 


ISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRS 962 
: : : : 1 : 1 : 1 : 1 : 1 1 1 1 




Db 


1973 


TERSRRLLDDISNGYDDTEDGGHTLGDCSYSYETTEKITSFPESESYS 2020 





Search completed: September 29, 2004, 18:15:50 
Job time : 36.465 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: September 29, 2004, 18:16:15 ; Search time 124.84 Seconds 

(without alignments) 
2997.869 Million cell updates/se 

Title: US-09-830-972-2 
Perfect score: 5848 

Sequence: 1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKIPGLKRKAD 1163 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1351062 seqs, 321799191 residues 

Total number of hits satisfying chosen parameters: 1351062 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_AA: * 

1: /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep:* 
2: /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 
3: /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep:* 
4: /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep:* 
5: /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep:* 
6: /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep:* 
7 : /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB . pep : * 
8: /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep:* 
9: /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep:* 
10: /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep:* 
11: / C gn2j5/ptodata/2/pubpaa/US09C_PUBCOMB.pep:* 
12: /cgn2_6/ptodata/2/pubpaa/US09J4EW_PUB.pep:* 
13: /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep:* 
14: /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep:* 
15: /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep:* 
16 : / cgn2__6/ptodata/2/pubpaa/USl 0_NEW_PUB . pep : * 
17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: + 
18: /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed 
and is derived by analysis of the total score distribution. 
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US-10-267-502-432 


29 


789.5 


13 


. 5 


776 


12 


US-10-660-946-5 


30 


789.5 


13 


. 5 


116 


12 


US-10-267-502-430 


31 


716. 5 


12 


. 3 


356 


12 


US-10-660-946-6 


32 


706. 5 


12 


. 1 


593 


15 


US- 10-108-2 60A-2 8 92 


33 


685 


11 


.7 


208 


12 


US-10-660-946-7 


34 


671 


11 


. 5 


267 


12 


US-10-660-946-8 


35 


671 


11 


. 5 


267 


14 


US-10-205-194-127 


36 


630 


10 


. 8 


266 


12 


US-10-276-774-2330 


37 


625 . 5 


10 


.7 


236 


9 


US-09-729-674-20 


38 


625 . 5 


10 


.7 


236 


9 


US-09-765-205-26 


39 


625.5 


10 


.7 


236 


12 


US-10-408-967-2 


40 


625.5 


10 


.7 


269 


14 


US-10-106-698-6222 


41 


539.5 


9 


.2 


168 


10 


US-09-809-391-563 


42 


539.5 


9 


.2 


168 


10 


US-09-882-171-563 


43 


539.5 


9 


.2 


168 


12 


US-10-164-861-563 


44 


520 


8 


. 9 


222 


12 


US-10-267-502-428 


45 


519 


8 


.9 


241 


12 


US-10-660-946-3 



Sequence 18, Appl 

Sequence 431, App 
Sequence 6, Appli 
Sequence 6, Appli 

Sequence 71, Appl 
Sequence 2, Appli 
Sequence 23, Appl 
Sequence 429, App 
Sequence 9, Appli 
Sequence 9, Appli 
Sequence 7, Appli 
Sequence 164, App 
Sequence 20, Appl 
Sequence 6, Appli 
Sequence 6, Appli 
Sequence 24, Appl 
Sequence 8, Appli 
Sequence 72, Appl 
Sequence 4, Appli 
Sequence 21, Appl 
Sequence 25, Appl 
Sequence 1, Appli 
Sequence 9, Appli 
Sequence 4 67, App 
Sequence 4, Appli 
Sequence 11, Appl 
Sequence 93, Appl 
Sequence 432, App 
Sequence 5, Appli 
Sequence 430, App 
Sequence 6 , Appli 
Sequence 2892, Ap 
Sequence 7, Appli 
Sequence 8, Appli 
Sequence 127, App 
Sequence 2330, Ap 
Sequence 20, Appl 
Sequence 26, Appl 
Sequence 2, Appli 
Sequence 6222 , Ap 
Sequence 563, App 
Sequence 563, App 
Sequence 563, App 
Sequence 428, App 
Sequence 3, Appli 



ALIGNMENTS 



RESULT 1 

US-09-893-348-18 

; Sequence 18, Application US/09893348 
; Patent No. US20020072493A1 
; GENERAL INFORMATION: 

; APPLICANT: EISENBACH-SCHWARTZ , Michal 



APPLICANT: COHEN, Irun R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS , NERVOUS SYSTEM- SPECIFIC ANTIGENS AND 

THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/ 09/ 893, 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 29 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 18 
LENGTH: 1163 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-18 

Query Match 100.0%; Score 5848; DB 9; Length 1163; 

Best Local Similarity 100.0%; Pred. No. 6.1e-287; 

Matches 1163; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



X MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 

| 1 | | 1 | | | | | | | I 1 I I M I I II I I I I I I I M II I M I I I II I I II I I I I I 

1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 



Db 



Ov 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 

| | || I I I I I I I M I I I I I I I I I I I I II M I I I I I I M II 

PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 



Db 61 



Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 

MIIMIIIIMII I t 1 I I I 1 1 I I I i I I 1 t 1 I I I I I I I I I f 1 f I 

PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 



Db 121 



60 

60 

120 

120 

180 

180 



Ov 181 AASEPVI PS SAEKIMDLMEQPGNTVS SGQEDFPSVLLETAASLPSLSPLSTVS FKEHGYL 240 

| | | | | | | | || | | I I I I I I I I I M II I I M I I I I I I I I I I I I I I I M I I I 

Db 181 AASEPVI PS SAEKIMDLMEQPGNTVS SGQEDFPSVLLETAASLPSLSPLSTVS FKEHGYL 240 

0v 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

| | | | | | | | | | | | | | | | | I I I I I I II II M I I I I I I I I I I I I I I I I M I I I I I I I I 

Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

0v 301 AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMSWA 360 

| M 1 || | | M | | | | | | | I I I I II I I I I I I I I I I I M I I I I I I I I I M 

Db 301 AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMSWA 360 



Qv 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

| | | | | | | | | | | || | | | | || I I I I I I I I I I I II I II II I I I I I I I M I I I I I I I I I I N I I 
Db 36i pvREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 



420 



Qy 


421 


NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 

| | M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II II 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 II 1 II 1 1 1 

MPnnQTTPQTDFPVK-n^qRAYTTrASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 


480 


Db 


A O T 
421 


480 


Qy 


4 ol 


QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 

I M 1 1 1 1 1 1 1 1 MM 1 M M M M M M M M M M M M M M M M M 

oTTrryvTQDVTqMPRT VAVOnqFAnYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 


540 


Db 


A O 1 

4 o 1 


540 


Qy 


541 


NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 

M M M M M M M M 1 1 1 1 1 1 1 I'M 

ktt? a rr r m v t a vt?t m/TiT \tcyv ^ F a T n F 9 t , v PT AOT.C P S FEEAEAT P S PVL P D I VMEAPLNSL 


600 


Db 


C A 1 

541 


600 


Qy 


601 


LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 

M M M M M M M M M M 1 M M M 1 M M 1 1 M M M M M 1 

t DQAraqwriPW^PT FAPPPVSYDSTKLEPENPPPYEEAIWVALKALGTKEGIKEPESF 


660 


Db 


bUl 


660 


Qy 


661 


NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 

| | M M M M M M M M M 1 M M M M M M M M M M 1 M M M M 1 M M 

\T7\T\u^rpTraDVTQTarnT tkftkt qTFP^PnFSNYSEIAKFEKSVPEHAELVEDSSPESE 


720 


Db 


661 


720 


Qy 


721 


PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 

M M M M M M M M M M M 1 M M M 1 M M 1 M M M M M M M M M 

DwnT ircnnQT DTr\/"PnTnFF A\/MT.MKF c lT 1 TF.VSF,TVAOHKEERLSASPOELGKPYLESFQP 


780 


Db 


721 


780 


Qy 


781 


NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 

I 1 II 1 : ; 1 1 ' 

■»T-r UP mynA7\ gmht ptt TlflfWT ct OMFFFMT AT YSNTjDTiT.S SKEDKI KESETFSDSSPIE 
NLHSTKDAAoiN Dl ir 1 J_i 1 i\i\Cji\X o Xj^lYLXjiiir in i^aj. x jiNUjjjjUkJOAijL'i\j. i_i ± i- *j±s^*-> j. j. i-i 


840 


Db 


781 


840 


Qy 


841 


IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 

M M M 1 M M M M 1 M M 1 M M M M M M M M M M 1 M M 1 

TTni:unmT7\rc7\ynncD^T aPrFVTTiT FV^ PjK^FT ANTO ^ GADSLPCLELPCDLS FKNI YPK 


900 


Db 


841 


900 


Qy 


901 


DEVHVSDEFSENRS SVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 

M M M M M M i M M M M M 1 M M M M 1 1 M M 1 1 M M 1 

mU nnri?Q™Dc;QUc:v7\ ct cpcmrq AT.FPDTFMGS IVKSKSLTKEAEKKLPSDTEKED 


960 


Db 


901 


960 


Qy 


961 


RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 

MM M M M M M M M M M M M M M M M M M M M M M M M M M 1 

pct cn\/T qaft cjyTqwnT T.YWRnTKKTGVVFGASLFLLLSLTVFSIVSVTAYIALALLS 


1020 


Db 


961 


1020 


Qy 


1021 


VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 

| | | | | | | | I I I 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

WTSFRTYKGVIOAIOKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 


1080 




i no i 


1080 


Qy 


i n p i 


FLVDDLVDSLKFAVLM^FTWG7VLFNGLTLLILAJLI^ 

| | | | M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 
FLVDDLVDSLKFAVLIW^FTWGAiFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 


1140 


Db 


1081 


1 i /in 
1 14 u 


Qy 


1141 


NKSVKD7\MAKIQAKI PGLKRKAD 1163 

1 M 1 1 1 1 1 1 1 II 1 II M 1 1 1 1 1 1 

NKSVKDAMAKIQAKI PGLKRKAD 1163 




Db 


1141 





RESULT 2 

US-10-267-502-431 

; Sequence 431, Application US/10267502 
; Publication No. US20040071700A1 



GENERAL INFORMATION: 
APPLICANT: Kim, Jaeseob 
APPLICANT: Galant, Ron 

TITLE OF INVENTION: Obesity Linked Genes 
FILE REFERENCE: LSD-07416 

CURRENT APPLICATION NUMBER: US/ 10/2 67 , 502 
CURRENT FILING DATE: 2003-01-27 
NUMBER OF SEQ ID NOS : 439 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 431 
LENGTH: 1163 
TYPE: PRT 

ORGANISM: Mus mus cuius 
US-10-267-502-431 

Query Match 90.7%; Score 5307; DB 12; Length 1163; 

Best Local Similarity 91.1%; Pred. No. 1.3e-259; 

Matches 1066; Conservative 34; Mismatches 56; Indels 14; Gaps 7; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

M | | | | | I 1 I I I I I I I I I I I I I I I II I I I I I I I : I I I I I I I I I I I I M 

Db i MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGLSAAAVP PAAAAPLLDFS SDSVPPAPRGPLPAAPPAAPERQP SWERS PAAPAPS LP 120 

1,111,1 | M I I I I I I I I II I I I II II I I I I I I I I I I I I I I I I I I I I I I I 

Db 6 o PAAGLSAVPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

| | M I II I I I II I I M I I I I I I I I I I I I II I II I I I I I I I I I I I I I M I I I I I M I I 
Db 119 P AAAVL P S KL P EDD E P PAR — P P AP AGAS P LAE P AAP P S T P AAP KRRGS GS VDET L FAL P 176 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

| | | | | | | M I I I I I II I I I I M I I I I I I I M I II I I I I I I I I I I I I I I M 

Db 177 AASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYL 236 

q v 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

| | | | M : | : | | | | | I I M I I I : II I I I I I I I I I I I : I I I I M I I I I i I i I 

Db 237 GNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGES 296 

Qy 301 AI LVENTKEEVIVRSKDKEDLVCSAALHS PQES P VGKEDRVVS PEKTMDI FNEMQ 355 

| : | | || | | | I II I I I I I I I I I II I I I I I : I I I I I I Ml I I I I I M II I I I I : 

Db 297 AMLVENTKEEVIVRSKDKEDLVCSAALHNPQESPATLTKVVKEDGVMSPEKTMDIFNEMK 356 

q y 356 MSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGK 415 

| | || | | | | M I I II I I I I I I I I I I M M I I M I I : I I I I I : I I I I I I I II I II 

Db 357 MSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANMESKVDKKCFEDSLEQKSHGK 416 

Qy 416 DSEGRNEDASFPSTPEPVKDSSRAYITCAS FT SATESTTANTFPLLEDHTSENKTDEKKI 475 

IN | ! I : I I 1 t I 1 I I III MINI M M I I M:IMI Mill 

Db 417 DSESRNENASFPSTPELVKDGSRAYITCDSFTSATESTAANIFPVLEDHTSENKTDEKKI 476 

Qy 47 6 EERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEA 535 

| | | | | | | || | I I I I I I I I I I II I : IIIIMIMI IIIIMI I: I i M I I I I I I i M 
Db 477 EERKAQIITEKTSPKTSNPFLVAIHDSEADYVTTDNLSKVTEAWATMPEGLTPDLVQEA 536 



Qy 



536 CESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEA 595 
I | | | M | | I I I M I I I I I II I I I I I : M I I I I M II I I I I I I I I I I I I I 



Db 537 CESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEA 596 

Qy 596 PLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIK 655 

| I I I | I | | | | | | I I I I I I I I I M : I I I I M M 

Db 597 PLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDAKEEIK 656 

Qy 656 EPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDS 715 

| | I | I | M || I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I M I I : I MI:M 
Db 657 EPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNYSEIAKFEKSVPDHCELVDDS 716 

Q y 716 SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK-EERLSASPQELGKPY 774 

| | | | || II M I I I I I I I I I I I I I I I I i I I I I M I i I I II I III : I I I I I I I M : I I I I 
Db 717 SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPY 776 

Q y 775 LESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFS 834 

|| I I I I I I I I : II I I I I I I I I I I M M I I I I I I II I I : I I I M I I 

Db 777 LESFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAIYSNDDLLSSKEDKMKESETFS 836 

Qy 835 DSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSG7VDSLPCLELPCDLSF 894 

IMIII ,11, MIIIIM|:|IIIII:IMI:MII MINIM 

Db 837 DSSPIEIIDEFPTFVSAKDDSP KEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSF 893 

Qy 895 KNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPS 954 

I | | | || I M I M I I :: I I I I M : MM I : IMIMMII 

Db 8 94 KNTYPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKPKVLTKEAEEKLPS 953 

Qy 955 DTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGW-FGASLFLLLSLTVFSIVSVTAY 1013 

|| | || || | | : I II II M : I I M II II I II II II II I I II II II II M M II II I I I I M 
Db 954 DTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKTGWYFGASLFLLLSLTVFSIVSVTAY 1013 

Q y 1014 IALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNST 1073 

|| | | | | | I I I II I M M II II M I I M II I I M I II II I II I M II M II II II I M I II 

Db 1014 IALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNST 1073 

Qy 1074 IKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQI 1133 

I M I I I II MUM I II I II M I Ill M 

Db 1074 IKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQAQI 1133 

Qy 1134 DHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I II II II II I II I I II I I I I I I I I M II I : 
Db 1134 DHYLGLANKSVKDAMAKIQAKI PGLKRKAE 1163 



RESULT 3 
US-09-758-140-6 

; Sequence 6, Application US/09758140 
; Patent No. US20020012965A1 
; GENERAL INFORMATION: 

; APPLICANT: Strittmatter , Stephen M. 

; TITLE OF INVENTION: No. US2 002 00 12 9 65Alo Receptor-Mediated Blockade of 
Axonal Growth 

; FILE REFERENCE: 44574-5073-US 

; CURRENT APPLICATION NUMBER: US/ 09/7 5 8 , 14 0 

; CURRENT FILING DATE: 2001-01-12 

; PRIOR APPLICATION NUMBER: US 60/175,707 

; PRIOR FILING DATE: 2000-01-12 

; PRIOR APPLICATION NUMBER: US 60/207,366 



PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: US 60/236,378 
PRIOR FILING DATE: 2000-09-29 
NUMBER OF SEQ ID NOS : 20 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 6 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-758-140-6 

Query Match 75.3%; Score 4403.5; DB 9; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 5.9e-214; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

IM:||| | | I I I I I I I I I I I I I I I I I MINI I I I I I : I I I I I I I I M I II I I 
D b 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Q y 61 PAAGLSAAAVP--PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I I I I I I I M : I I : I I I I I I I I II I I I I I Ilhll: I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 



Qy 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Ml I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 17 8 



Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

|| | | || | M I I I I I I I I I I I I MM Ml I I I I I I : I : I M I II I II ! II I I I I I I 
Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

| | I I I I I I I I I I M M I : I I I : : I : : I I I II : I : I :: I I I II I I I I I 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KED 339 

| | | | | | I | I I I I I : : I I : I I : I I : : I I : I : I I : 11= II I IN 
Db 298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

| | | I I I I I I : : : I I I : II I I I I I II I I : I I I M : I I I : II I : I : 

Db 358 EWSSEK7U<DSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

|| | | | : | | Mill: I I I I I : I I I I I I M : I I I II II I I I Mill 
Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I | | M I I M I I II II I I I I : I I II : I I I M II I I I I I I I I II I I I I M I I : 
Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

| I I I I : II I II I I II II I I I M M I I I M II II II I I M I I I I II : I I II I I I I I 
Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 



Qy 



574 CPSFEE7VEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 
I II I M : I II I II I II II M II I II I M I I M I I : I I I Mill I : I : I II MM 



Db 



597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 



Qy 


634 


Db 


656 


Qy 


693 


Db 


716 


Qy 


753 


Db 


776 


Qy 


810 


Db 


836 


Qy 


869 


Db 


896 


Qy 


927 


Db 


956 


Qy 


987 


Db 


1016 


Qy 


1047 


Db 


1076 


Qy 


1107 


Db 


1136 



PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 
M I I I I I I : I : I I : I II Mill: I I I : I I I I I I I I I I I I M I I I I I M 11*111 



SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

| : | | | : | | | : I I : I : I I I I I I I I : I I I I I I I I I I I I I : I I I I : I II I M I I I II I 
<?nv«RMAKVF.DPVPnH53ELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 77 5 



| : : : : : : I : I I I I I I I I I I I I I : : I MM : : : I I M I II I I I I M 

FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 8 68 
: ! I : I I M I I Ml M M M I I II I I I I M M M II MM II I II M I I II II I 



DKSEIANIQSGADSLPCLELPCDLSFKNIYPK— DEVHVSDEFSENRSSVSKASI SPSNV 926 
I I I I I I I I II II I I I I II II I M : : : I I M I M I : II : I M 



SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 98 6 

Ml I I : I I I I I I I I I I I I M M I I II I I I M: I M I I II I I I I I I I I I I I I 
SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

I | | | | | I I I I I I M M I I I I I II I I II II I I I I I I M I M M I II II I II I I I I I I I I M 



YLE S EVAI S EELVQKYSN S ALGHVN ST I KELRRLFLVDDLVD S LKFAVLMWVFT YVGALF 1106 

I I I II I I I II I I II I I II II II I II II II II II I I I I II II I II I I I II I II I I I II M 

YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFT YVGALF 1135 

NGLTLLIIJVLISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 
M | II I I M I I I I I I M I I I I I M II II II I I I II M I I I I I II I I II I I I I M I : 

ktct.tt.t.TT.AT.T ^T.FSVPVT YKRHOAOIDHYLGLANKNVKDAMAKIOAKI PGLKRKAE 1192 



RESULT 4 

US-09-972-599A-6 

; Sequence 6, Application US/09972599A 
; Patent No. US20020077295A1 
; GENERAL INFORMATION: 

; APPLICANT: STRITTMATTER, STEPHEN M. 

; TITLE OF INVENTION: NOGO-RECEPTOR-MEDIATED BLOCKADE OF AXONAL GROWTH 
; FILE REFERENCE: C077 CIP US 

; CURRENT APPLICATION NUMBER: US/09/972,59 9A 

CURRENT FILING DATE: 2001-10-06 
; PRIOR APPLICATION NUMBER: PCT/US01/01041 
; PRIOR FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: 09/758,140 
; PRIOR FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: 60/236,378 
; PRIOR FILING DATE: 2000-09-29 
; PRIOR APPLICATION NUMBER: 60/207,366 
; PRIOR FILING DATE: 2000-05-26 



PRIOR APPLICATION NUMBER: 60/175,707 
; PRIOR FILING DATE: 2000-01-12 
; NUMBER OF SEQ ID NOS : 57 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 6 

LENGTH: 1192 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-972-599A-6 

Query Match 75.3%; Score 4403.5; DB 9; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 5.9e-214; 



Matches 


909; Conservative 104; Mismatches 145; Indels 39; Gaps 


20 ; 


Qy 

Db 


1 
1 


MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 

|||:||| I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 :: 1 1 1 M 1 1 1 1 1! 1 1 
MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 


58 


Qy 


61 


PAAGLSAAAVP- - PAAAAPLLDFS SDSVP PAPRGPLPAAP PAAPERQPSWERS PAA P 

|| | | | I I I I I | M 1 1 1 : 1 1 : 1 1 1 1 1 M 1 1 M 1 II 1 1 1 1 1 1 1 1 1 : 1 1 : 1 
PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 


i i c; 
11 j 


Db 


59 


IIP 
no 


Qy 


116 


APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 

III 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 
APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 


166 


Db 


119 


T 7ft 


Qy 

Db 


167 
179 


RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 

|| I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 i 1 1 1 M M 1 1 1 1 II II I 1 II II II 1 1 
RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 


22 5 
237 


Qy 


226 


LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 

| | | | | | I I I I 1 1 1 1 1 1 1 : 1 M : : 1 : : 1 1 1 1 1 : hi - 1 1 1 1 1 1 1 1 1 1 
LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 


285 


Db 


238 


297 


Qy 


286 


S EMGSS FKGS PKGESAI LVENTKEEVI VRS KDKED- LVC SAALHS PQES PVG KED 

M 1 1 1 1 1 1 1 1 1 1 1 : : 1 1 : 1 1 : 1 1 : : 1 1 : 1 : 1 1 : II : 1 1 1 Ml 
SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 


339 


Db 


298 


357 


Qy 


J4 U 


t-)t n m Di?v™nTTrM™nMQ\A7aPVRPFyanFKPFFnAWFVKnTYF.R^RDVLAA RAN V 

| | 1 II 1 III : : : 1 1 1 : II 1 1 1 M 1 M M 1 1 II M 1 1 Mill M : 
EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 


395 


Db 


358 


416 


Qy 


396 


ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 

|| | | | : || III 1 ,: MM 1 : 1 1 II 1 1 M M 1 1 Mill 

ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIA 


454 


Db 


417 


476 


Qy 


455 


ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 

| | | | I I 1 II 1 1 II II II 1 1 M 1 1 1 : II 1 II 1 1 1 II 1 II 1 MM MUM M 
TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 


513 


Db 


477 


536 


Qy 


514 


KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 

MM I : 1 II 1 1 1 II 1 II II II II 1 1 1 1 M II 1 II II 1 : 1 1 1 1 1 1 1 M 1 1 1 II Ml 
KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 


573 


Db 


537 


596 


Qy 


574 


CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 

II II 1 M M 1 1 II II 1 1 1 1 1 M II II M II 1 II 1 : II 1 Mill M M M 1 1 1 1 1 
CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 


633 


Db 


597 


655 



Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

| | | | | | | | : | : | | : I II Mill: I I I : I I I I I I I I I I I I I I I I I I I I I I I = I I I 
Db 656 PPPYEEAMSVSLKKVSGIKEEI KEPENINAALQETEAPYI SIACDLI KETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 7 52 

| : | 1 | : || | : I I : I : II : I I I I I I I I I I M I : 1 I I hi I M : I I I I I I I 

Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

| : : : :: : I : M I I I II II I I I I = =1 : I I I : " I I : I I I I I I I I I I 

Db 77 6 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Q y 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: | | : | | I I I I I M : I : I : I I I I I I I I 1 I I M I I I I : I : I II I I I M I II I I I I 
Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Q y 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASI SPSNV 926 

| | I I I I I I I I I I I I I 1 I I 1 M I I : : : I I M I M I : I I : I : I 
Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

Ml | | : MM I I I M I M I I I I II I I : I I M I I I 1 I II I I I 

Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M II I I I I I II M I M I I I I II I I I 

Db 1016 KTGWFGAS LFLLLS LTVFS I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGH P FRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

|| M I II I II I I I I II I I I M I I M I I I I M I I II I I I I I II I I I I M I M I I I M I I I 
Db 1076 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALF 1135 

Qy 1107 NGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I M I I I I I II M M I : I I I II I I I I II I I M I M II I I II I I I I M I I I : 

Db 1136 NGLTLLIIALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 5 

US-10-060-036-71 

Sequence 71, Application US/10060036 
Publication No. US20030073144A1 
GENERAL INFORMATION: 
APPLICANT: Benson, Darin R. 
APPLICANT: Kalos, Michael D. 
APPLICANT: Lodes, Michael J. 
APPLICANT: Persing, David H. 
APPLICANT: Hepler, William T. 
APPLICANT: Jiang, Yuqiu 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/10/ 060, 036 
CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS : 4560 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 71 



LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-060-036-71 

Query Match 75.3%; Score 4403.5; DB 14; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 5.9e-214; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

| | | : | | | I I I I I I I I I I I I I I I I I I I I I I I I I I II I I : I I :: I I I M II I I I I II 
D b 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP- - PAAAAPLLDFS SDSVPPAPRGPLPAAPPAlAPERQPSWERS PAA P 115 

| | M | I I I II I I I I I I : I I : I I I M I 1 I I I M I I I II I I I I I I : M : I 
Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 



Qy 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE P AAP P S T PAAP KR 166 

I I I I I I I I I II I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 



Q y 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

|| I I I II II I I I I I I I I I I I I I I I I Ml M I I I I : I : I II I M I I I I I I I I I I II 
Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Q y 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

I I I I I I I II I I I : I I I : : I :: I I I I I : I : I :: I I I I I I I I I I 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 S EMGS SFKGS PKGESAI LVENTKEEVIVRSKDKED-LVCSAALHS PQES PVG KED 339 

| | | | | I I | | | | | | : : I I : I I : I I : : I I : I : I I : I I : I I I I I I 

Db 298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

| | | I I | | | | ::: I I I : I I I I I I I I I I I : I I M I : I I I : I I I : I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Q y 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

| | | | | : | | Mill: I I I I I : I I I I I I II : M I I I I I M I : I M I 
Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

| MM I I I I I M I M I M I : M I I : M I II I II I I I I I I I I I I I I : 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

|| I I I : I II II I I II I I I I I I I I I I M I I I I I I I I I I : I I M I I I : I I I I M Ml 
Db 537 KVTEEVVANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I M I : I I II I II M I M I : I I M M I : I M I I II I I : h II I MM 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

|| I I II I I : I : I I - I M Mill: I I I : II II I II II I I I M M M II I I I : I M 
Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 



Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

| : | | | : | | | : | | : I : I I I I I I I I : I I I I t I I I I M I I : II I hi I I I : I I M I I I 
Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

| : : : : : : | : I I I I I MINIM: : I MM : : : I I M II I I I I I I I 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: M M I II I I I II M M M I II I II I I II I I I II I MM M I I I M II I I I I I 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVS KASISPSNV 926 

|| | M I I I I I I I I I I II I I I I I I : : : M M I M I : I I : I M 
Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Q y 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

Ml I I : II I I I I II I I II I I I I M M I M II: I I I II II I I I M II I I I I I 
Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

|| | || I I I II I I I I I I I M I I II M II II I I I II I I I II II I I I I I M I I I M II I I I II 
Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YL E S EVAI SEE LVQ K Y S N S AL GHVN S T I KE LRRL FLVD D LVD S L K FAVLMWVFT YVGAL F 1106 

M I I I II I I I I I M I I I I II II I I I I II I II I M I I II I I II I I I I M I I II II I I M I 
Db 1076 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

|| | || I || I I M II I M I I I II I I I I I I I I I I I I I M II I I I I I I II I I I I I I I I : 
Db 1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQT^IPGLKRKAE 1192 



RESULT 6 
US-09-789-386-2 

; Sequence 2, Application US/09789386 

; Patent No. US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 

; TITLE OF INVENTION: NOVEL COMPOUNDS 

; FILE REFERENCE: GP-30165-C1 

; CURRENT APPLICATION NUMBER: US/09/789, 386 
; CURRENT FILING DATE: 2001-02-21 

PRIOR APPLICATION NUMBER: U.K. 9916898.1 
; PRIOR FILING DATE: 1999-07-19 

PRIOR APPLICATION NUMBER: U.K. 9816024.5 
; PRIOR FILING DATE: 1998-07-22 
; PRIOR APPLICATION NUMBER: US 09/359,208 
; PRIOR FILING DATE: 1999-07-22 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 2 

LENGTH: 1192 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-2 



Query Match 75. 2%; Score 4398.5; DB 9; Length 1192; 

Best Local Similarity 75.9%; Pred. No. le-213; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 



20; 



Qy 


1 


MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 

i i i . i i i i i i i i l 1 1 1 1 1 1 1 I 1 1 1 1 I MINI 1 1 1 1 1 : 1 1 : : 1 1 1 1 1 ! 1 11 1 1 11 
| | | : | j | 1 1 1 1 1 1 1 1 M 1 l 1 II M l > i i i i i i i m ii ■ i i • ■ i ii m i i m i i i i 

MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 


60 


Db 


1 


58 


Qy 


61 


PAAGLSAAAVP — PAAAAPLLDFS SDSVPPAPRGPLPAAP PAAPERQP SWERS PAA P 

i i i i i i i i ii mi I I I • I I • I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 It * 1 1 : 1 
I I I I | I 1 1 II 11 1 t 1 1 • 1 1 • 1 1 1 1 1 M 1 1 1 l l l l l l l l l l I I I • ii ■ i 

PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 


115 


Db 


59 


118 


Qy 


116 


APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 

1 1 1 MM II 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II II t II 1 1 1 1 1 1 1 1 I 1 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 M II || II M 1 M 1 1 M 

APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 


166 


Db 


119 


178 


Qy 


167 


RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 

II 1 1 M 1 1 M M II 1 II 1 M 1 1 II 1 III It M 1 1 • 1 • 1 II 1 1 1 1 1 1 1 1 1 1 M If 1 

|| 1 1 i t | M | I 1 I 1 11 1 1 1 1 1 1 1 1 1 III 1 1 M 1 1 - 1 - 1 M 1 1 II II I 1 1 1 M 1 M 
RGS S GSVDETLFALPAAS EPVI RS SAEN-MDLKEQPGNTI SAGQEDFPSVLLETAASLPS 


225 


Db 


179 


237 


Qy 


226 


LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 

1 1 1 i 1 Mill 1 1 1 1! 1 1 • 1 1 1 - • t ••Mill* 1*1 - * 1 1 1 1 1 M 1 II 
Mill Mill 1 1 1 1 M 1 • 1 M ■ ■ 1 ••Mill* 1 ■ 1 ••III I I M I t I 

LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 


285 


Db 


238 


297 


Qy 


286 


SEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KED 

ii ii i M til i i i . • i i . | 1 . 1 | • • I I • I • II • I I • M 1 III 
| | | || M Ml II 1 • • 1 1 - II • 1 1 • • 1 1 • 1 • ll • II* ii i 1 i 1 

SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 


339 


Db 


298 


357 


Qy 


340 


RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 

mi ii i i i i > < • i I i • I I 1 I 1 M 1 1 1 1 • 1 1 1 i 1 * E 1 Mill : 1 : 

Ml || | 1 | | . - . 1 M • I 1 1 1 1 1 1 1 1 1 1 • Mill* 1 1 Mill • M 
EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 


395 


Db 


358 


416 


Qy 


396 


ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 

i it i m i i M I I I • l M 1 ME It 1 1 1 1 1 * II II 1 1 II 1 : 1 1 1 1 
|| 1 It : 1 1 1 1 1 1 1 • MM 1*1 If 1 1 1 1 1 Ml M M II 1 -MM 

ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 


454 


Db 


417 


476 


Qy 


455 


ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 
i i ii i i I ll it I M I 1 II M 1 1 1 1 • 1 1 1 II II M 1 1 1 M MM 1 II 1 11 1 : 

| | I I I 1 1 II 1 II 1 1 II 1 1 1 * 1 M 1 • 1 1 1 II 1 1 M 1 1 1 M MM 1 1 M M l 

TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 


513 


Db 


477 


536 


Qy 


514 


KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 
till M ll l l l l ll l l 1 1 1 1 1 1 II 1 II 1 1 M II 1 M M 1 II II M : \ 1 1 1 M Mi 

M II 1 ■ M II 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M M M 1 M M It M II M ill 

KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 


573 


Db 


537 


596 


Qy 


574 


CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 

II 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 : 1 1 II 1 1 1 : 1 1 1 1 1 1 1 1 1 : 1 : 1 1 1 MM 
CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 


633 


Db 


597 


655 


Qy 


634 


PPPYEEAMNV7VLKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 

M 1 1 1 1 1 1 : 1 : 1 1 : 1 11 Mill* 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 = 1 1 1 
PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 


692 


Db 


656 


715 


Qy 


693 


SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 

1 : 1 1 1 : 1 1 1 : 1 1 : 1 : II 1 1 1 1 1 1 : 1 1 II 1 1 1 1 1 1 1 1 1 : 1 1 1 hi 1 M : 1 1 1 1 1 1 1 
SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETA/MLVKESLTETS 


752 


Db 


716 


775 



Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

| : : : : : : I : I I I I I I I I I i I I I : : I : I I I : : = I I : I I I I I I I I I 1 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

:||:|||||| Ml :|:|:IMI t t I I I 1 : I : I :M I 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK— DEVHVSDEFSENRSSVSKASISPSNV 926 

I I I I M I I II 1 I I II I I I I I I I I : : : I I : I I : I I : I I : I : I 
Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Q y 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

IN I I : I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I II M II I I I I I I I I I 
Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

| | | | | | | | I || I I I I I I I II I I I M I I II I I I M II I I I II I I I I I I I I I I M M I I I I I 
Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

I I II I II I I I I I I I I I I II I I M I I I I I I I I I I I I II I I I I I I I 

Db 107 6 YLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLIIALISLFSIPVIYERHQVQIDHYLGI^KSVKDAMAKIQAKIPGLKRKAD 1163 

I | || | | | I I I I I I I I : II I I I I I I I I I I : I II I I I I I I I I I I : 

Db 1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 7 

US-09-893-348-23 

Sequence 23, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, Irun R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EI S- SCHWARTZ =2 A 
CURRENT APPLICATION NUMBER: US/ 09/ 8 93 , 34 8 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 2 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 23 
LENGTH: 1192 
TYPE: PRT 



; ORGANISM: Homo sapiens 
US-09-893-348-23 



Query Match 75.2%; Score 4398.5; DB9; Length 1192; 

Best Local Similarity 75.9%; Pred. No. le-213; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

| | | : M I I I I 1 I MINI I I I I I I I I I I I I 1 I I I I I I : I I - I I I I I I I M I M I 
Db x MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 P AAGL S AAAVP - - P AAAAP LLD FS S D S VP PAP RG P L P AAP P AAP E RQ P S WERS P AA P 115 

MINIM II I M I II : I I : I I I I I I I I M I I I I I 1 I I I I I I I : I I : I 
Db 59 P AAGL S AAP VPT AP AAGAP LMD FGND FVP PAP RG P L P AAP P VAP ERQ P S WD P S P VS S T VP 118 



Qy 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE P AAP P S T P AAP KR 166 

Ml II II I I I I I I I I I I I I I I I I I M I M II II I I I I I I I I I I I 

Db H9 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 



Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

M MIIIIMMIIII I I I I I : I : I I ; ; , M I I I I I I I I I M 

Db 179 RGS S GSVDETLFALPAASEPVI RS SAEN-MDLKEQPGNT I SAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill Mill llllll I :MI::I ::MMI: hi :MM MMMI 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KED 339 

MMMI I M I I I : : I I : I I = I I = : M M : I I : I I : I I I I I I 

Db 2 98 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Q y 340 RWS PEKTMDI FNEMQMS WAPVREE YAD FKP FEQAWEVKDT YEGS RDVLAA RANV 395 

| | | | | | M I : : : I I I : I I I I M I I I I I : M II I : I I I : I I I = I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Q y 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

Mllhll Mill: I I M I : I II I I I I I : M I I II I I I : I I I I 
Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I MM I I I I I I I I : I M I : I I I II I I I I II I M I II I II : 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

MM I : I I M I II I I I I I I I I I I I M I M I II I II I I : I I II I I I MMMI III 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 59 6 

Q y 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

|| I I I I : II I I II I I I I I I I I I I I I I : I I I M II : I II II II I I : I : I M I I II 
Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

| M II II I : I : I I : I I I I I II I : I I I : I I I I I I I I M II I II I I I I I I I I : I I I 
Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Q y 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

| M I I : M I : I I : I : I II I I I I I : I I I I I M I I I I II : II I Ml II I : I I I M I I 



Db 



716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 



Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

| : : : : : : I : I I I I I I I I I I I I I : : I MM ■ M : I M M Mill 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: | | : | | | | | | I I I M M M M M M M M M M M MM I I M I M I M M M 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Q y 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK— DEVHVSDEFSENRSSVSKASISPSNV 926 

I I I I I I M M M M I M I I M M : : : M M I M I : M : I M 
Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Q y 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

Ml | | : I I I I I I M I I I I M II M M I II I I : I I I M I M M I I I M I I I I 
Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

I I I I I II II II M I I I I M I I I I I I II M I I I M II I I I I M I I M I I I I I I 

Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 Y L E S E VAI SEE L VQK Y S N SAL GH VN S T I KE L RRL FL VD D LVD S L K FAVLMWVFT YVGAL F 1106 

| | | | | | | || I I I I I I I II I I I I I I I I M I I M I I I I 1 I I I I M I I I I I I I M I I 

Db 107 6 YLES EVAI S EELVQKYSN S ALGHVNCT I KELRRL FLVDDLVDS LKFAVLMWVFT YVGAL F 1135 

Qy 1107 NGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I I I I I I I I I I M I I I I I M I I I I I I I li M M I I I I II I I I II I I I M M : 
Db 1136 NGLTLLI LALI SLFS VPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 8 

US-10-267-502-429 

; Sequence 429, Application US/10267502 

; Publication No. US20040071700A1 

; GENERAL INFORMATION: 

; APPLICANT: Kim, Jaeseob 

; APPLICANT: Galant, Ron 

; TITLE OF INVENTION: Obesity Linked Genes 
; FILE REFERENCE: LSD-07416 

; CURRENT APPLICATION NUMBER: US/ 10/267, 502 

; CURRENT FILING DATE: 2003-01-27 

; NUMBER OF SEQ ID NOS : 439 

; SOFTWARE: Patentln version 3.2 

; SEQ ID NO 429 

; LENGTH: 1192 

; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-10-267-502-429 

Query Match 75.2%; Score 4398.5; DB 12; Length 1192; 

Best Local Similarity 75.9%; Pred. No. le-213; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 2 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

M I M I I III II I II II I MUM I I I II M I : M I I I I I I I I M I I 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 



Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA— -P 115 

| | | | | | M II III I I I : I I M I I I I I I I M I I I I I I I I I I I I I : II : I 
Db 59 P AAGL S AAP VPT AP AAGAP LMD FGN D FVP PAP RGP L P AAP P VAP ERQ P S WD P S P VS ST VP 118 

Qy H6 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

| | I II I I I I I I I I M I I I I II I I I I I I MM I I I I I I I I II I I I 

Db H9 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 17 8 

Q V 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

|| M || || | | | I I || I II I II I I I I IN lllll|:|:IIIMIIMIIIIIIIII 
Db 179 RGSSGSVDETLFALPT^ASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

| | | || || I I I I I : I I I : : I : : I I I II : I : I : : I I 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 2 86 SEMGS S FKGS PKGESAI LVENTKEEVIVRSKDKED- LVCSAALHS PQES PVG KED 339 

Mlllll II I I I I : : I I : II : I I 5 : I I : I : II : 11= M I Ml 
Db 2 98 SEMGS S FSVS PKAESAVTVANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

| | | M | | | | ::: | I I : I I I I II I M M : I I I II : I I I M II M : 

Db 358 EWSSEK7VKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

IMIIMI Mill : MM Ml IMMII : I I Mill 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 

Q V 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

| | | | | | | | I I I I I I I I I II M I I I : II I II I I II I M I I : 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qv 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

| | | | |: I 1 I I 1 I I I I I I MM M I I I II I It : I I I I I I I Mlllll Ml 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I M II I I II I II I I I I I I M I MINIM: I 1 Mill IMMII MM 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qv 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

| | | | | | | | M M I : I II I I I I I: I I I M I II M II I I II M II I I I M II M I I 
Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

| : | M M I M II M M I I I I II I M I I I I I I II II II M I I Ml I II M I II I I I 

Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

0v 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

|:: ::: MMM I I MINIM: : I MM ::: MMMM Mill 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: | | M I I M I II I M : I M M II I II I II M II N MM II I II M I II M II 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 



Qy 


869 


Db 


896 


Qy 


927 


Db 


956 


Qy 


987 


Db 


1016 


Qy 


1047 


Db 


1076 


Qy 


1107 


Db 


1136 



DKSEIANIQSGADSLPCLELPCDLSFKNIYPK-- DEVHVSDEFSENRSSVSKASISPSNV 926 

| | | | | | M I I I I I I I I M I I I M ::: l|:M:| I: II : I :l 
HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 



HI | | : | | | | | | || I I I I I I I I I II II M II: M I I I M I I I I I I I I I I I I 



I I II I I I I I I I I I I I II I II I I I I I M I I I I I M II I I 

1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVIMATVFTYVGALF 1106 
| | | | | M | | I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I M I I II I I 1 I I I 

t?e m ™TCIPT?TWnvv<5MQaT.rcm^ 1135 



I I M I I MII:MIIIIM I I I I II I I I I I : I I I I I I I M I I I I M I I I I 



RESULT 9 
US-10-327-213-9 

Sequence 9, Application US/10327213 
Publication No. US20040121341A1 
GENERAL INFORMATION: 
APPLICANT: FILBIN, MARIE T. 
APPLICANT: DOMENICONI, MARCO 
APPLICANT: CAO, ZIXUAN 

TITLE OF INVENTION: INHIBITORS OF MYELIN-ASSOCIATED GLYCOPROTEIN (MAG) 
TITLE OF INVENTION: ACTIVITY FOR REGULATING NEURAL GROWTH AND REGENERATION 
FILE REFERENCE: CUNY/ 003 

CURRENT APPLICATION NUMBER: US/ 10/ 327 , 2 13 
CURRENT FILING DATE: 2002-12-20 
NUMBER OF SEQ ID NOS: 43 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 9 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-327-213-9 

Query Match 75.2%; Score 4398.5; DB 16; Length 1192; 

Best Local Similarity 75.9%; Pred. No. le-213; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

|||:1|| Mill IIIMI IIIIM I I I I I t I I = 1 I ' = t I I I I I I I I I I * 1 

Db i MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

QV 61 PAAGLSAAAVP— PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA— P 115 

IMM II 111 = 11 =1 I I I 1 I I I I I I I I I I IN: M ^ I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 



Qy 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Ml I I M I I M I I I I II M >l I I II I I I I I I I I I 

Db H9 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 17 8 



Qv 167 rg-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

|| | M | II I I I II II II 111111:1:11 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qv 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill | | | I I MINI I :MI::i ::IMM: hi ::IM IMIIM 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 28 6 SEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KED 339 

| | | | | | | | | | | | I : : I I : I I : I I : : I I : I : I I : I I : I I I I I I 

Db 298 S EMGS S FSVS PKAES AVIVANPREEI I VKKKDEEEKLVSNNI LHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

| | | | | | | | | ::: I I I : M I I I I II I I I : I I I II : I I I : I M : I : 

Db 358 EWSSEICAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

Mllhll Mill : I I I I hi IMIIM :M IMIIM Mill 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I MM I III IMIMMIMM M MINIM MINI h 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I || | | M I II M II I II M II I M II I II I I M I II I M II II I I M I II I I III 

Db 537 KVTEEVVANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

| M I II M I M I II I I M I I 1 I I I M M II I II I : I II Mill I : I M I I MM 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qv 6 34 pppYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

MIMIMMMI : I - I I I : I M M I I I II I I Ml MMM 

Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

I : I I | : I I | : I I : I : II II II I I M M I I I II M I M : II I I M I M M I M M I 
Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

0v 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

| : : : : : : | : I II I I I I I I II II : : I MM : : : M M I I I I Mill 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: I | M I I I M III M M M II II I M II I II I I I I MM II II I M I II I I II 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 8 69 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNV 92 6 

MMM II M Ml III M ::: IMIIM M II : I M 

Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qv 927 SALEPQTEMGSIVKSKSLTKEMKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

: MM I I I M I I II M II I II I M II: Ml I M 

Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 



Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

| | | | I I I I I I I I I I I i I I I I I I I i I II I I I II I I II II I I 

Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 ii ii 1 1 1 1 1 1 1 1 1 1 1 > 1 1 1 1 1 1 Mini 

Db 1076 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLI LALI SLFSI PVI YERHQVQIDHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

| | I | I i I I I I I I I I I : I M I 1 II I I I M M I I II I M I I I I I II II I I I M I I M : 
Db 1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 

RESULT 10 
US-10-466-258-9 

Sequence 9, Application US/10466258 
Publication No. US20040132096A1 
GENERAL INFORMATION: 
APPLICANT: GLAXO GROUP LIMITED 
TITLE OF INVENTION: ASSAY 
FILE REFERENCE: P80966 GCW 

CURRENT APPLICATION NUMBER: US/10/466, 258 
CURRENT FILING DATE: 2003-07-15 
NUMBER OF SEQ ID NOS: 13 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 9 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-466-258-9 

Query Match 75.2%; Score 4398.5; DB 16; Length 1192; 

Best Local Similarity 75.9%; Pred. No. le-213; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

| | | : | | | I M II I I M M II I I M II M II :M I I I I I I I I I M I 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 



QY 



61 PAAGLSAAAVP — PAAAAPLLDFS S DS VPPAPRGPLPAAPPAAPERQPSWERS PAA P 115 

| | | I I I II II I : I I : I I II I I I II I M II : I I : I 

Db 59 P AAGL S AAP VPT APAAGAP LMD FGN D FVP P AP RG P L P AAP P VAP E RQ P S WD P S P VS S T VP 118 



Qy 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE P AAP P S T P AAP KR 166 

Ml I I I I II II I II I II I I I I M I I II MM I I I I M I II II I I 

Db H9 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 



Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

|| MMM II MM Ml I M I II = I = i i I I I I I I I M M I M II 

Db 179 RGS SGSVDETLFALPAASEPVI RS SAEN-MDLKEQPGNT I SAGQEDFP SVLLETAASLP S 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill Mill MMM I M I M M : : I II II : Ml : : I I I II II M I 
Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 2 97 

Qy 286 SEMGS S FKGS PKGESAI LVENTKEEVI VRS KDKED-LVCSAALHS PQES PVG KED 339 

Mlllll Ml MMM I MMIMMMM II : IM II I Ml 



Db 298 S EMGS S FSVS PKAESAVI VANPREEI IVKNKDEEEKLVSNNI LHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

| | | | | | | | | : : : | I I : M I I II > I I M : I I I I I : I I I : I I I : I : 

Db 358 EVVSSEK7VKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

|||||:|| : I I I I I : I I : I I I I I I I I I = I I M 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 47 6 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

| M I I I I 1 I I I I I I I M I I : M I I : I I I I I I I M I I I I I I I I I I I I I I I I : 
Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

| M I I : I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I : M I II I I : I I I I I I I I I 
Db 537 KWEEWT^MPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

| M I I I : I I I M I I I I I I I I :MIIIII:lil Mill hlUII MM 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEP3PDF 692 

||||||||:|:|| : I I : I II : I I I I I I I I M I II II I I I I I I M M I I 

Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Q y 693 SNYSEIAKFEKSVPEH7VELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

| : | | | : | | | : | I : I : I II I I M I : I II I I I M M I I I M II Ml II M I I II M I 
Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qv 753 -ETVAQHK-EERLSAS PQELGKPYLESFQPNLHSTKDA-ASNDI PTLTKKEKI SLQMEEF 8 09 

|:: ::: :|:||| I I I I 1 I I I I I : : I MM ::: MM II II Mill 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: | | : | | || | | III : I : I : I II II II I I I II II I II MM II I I I M I I I I I M 
Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 8 69 DKSEIANIQSGADSLPCLELPCDLSFKNI YPK — DEVHVSDEFSENRSSVSKASISPSNV 926 

| | | | | I I I II II I II I M I I I II : : : I I M I M I : I I : I M 
Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

Ml | | : | || I I I II II II II I I I M M II M : I I I I I M I I I I I I I M I I I 
Db 956 S7VLATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Q y 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

| | | | || || | | II I I II I I II I I I M II I I I M II I I I M M I I I I II I I I I I I I I I I I I I 

Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 

Qy 1047 YLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

I || || | | || II I I I II I I II I II M I I I I I M M II I II I I II I M I 

Db 1076 YLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Q y H07 NGLTLLILALISLFSIPVIYERHQVQIDHYLGIANKSVKDAMAKIQAKIPGLKRKAD 1163 

| | | | I I I II II I I I I M I II I I I I II I M I I I M I M I I I I I I II M I I II II M : 
Db H36 NGLTLLI lAil SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1192 



RESULT 11 
US-10-408-967-7 

Sequence 7 , Application US/10408967 
Publication No. US20040063161A1 
GENERAL INFORMATION: 
APPLICANT: Pharmacia & Upjohn Company 
APPLICANT: Yan, Riqiang 
APPLICANT: Lu, Yifeng 

TITLE OF INVENTION: Compositions and Methods of Treating Alzheimer's Disease 
FILE REFERENCE: 00925 

CURRENT APPLICATION NUMBER: US/ 10/4 08 , 967 
CURRENT FILING DATE: 2003-04-08 
NUMBER OF SEQ ID NOS : 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 7 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-408-967-7 

Query Match 75.1%; Score 4389.5; DB 12; Length 1192; 

Best Local Similarity 75.8%; Pred. No. 3e-213; 

Matches 907; Conservative 104; Mismatches 147; Indels 39; Gaps 20; 

Oy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

| | |: | | | I I I I I M I I I I I II I I M llll|:||::IIIIMIIMIII 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGL SAAAVP — P AAAAP L LD FS S D S VP PAP RGP L P AAP PAAP E RQ P S WERS P AA P 115 

| || M | M M M I I I I : M : I I I I I I I I I M I I I I II I I I I I I : I I : I 
Db 59 PAAGL S AAP VPT AP AAGAP LMD FGN D FVP PAP RG P L PAAP P VAP ERQ P S WD P S P VS S T VP 118 



Qy 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP KR 166 

Ml | | | | || I I I I I I I II II i I I I M I II M I I I I I II I I I I II 

Db H9 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 



Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

|| MINIMI MM MM Ml II I I I I M M II II II M I II M II M 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill Mill llllll I :lll::| :MIIII: 1:1 :MM MMMI 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Q y 286 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KED 339 

I I I II II : : I I : I I M I : M I M : I I : II : M I I I I 

Db 298 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

|M M I Ml : : : I I I : II II II I II II : Mill: I I Mill : I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

| | || I : M Mill : MM M 11=1 II Mill 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 



Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

| MM 1 M I I I 1:1 I I I :l I I I I MMMMI MM I : 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

MM M M M M M M M M M M M I I M M M M i : I M I I M : M M M Ml 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

M M M : M M M M M M M M M I M M M M : M I MM! I : I : I I I MM 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

| | | I I I I I : I : I I : I M I I I I I : M M M M M M M M M M M M I I M M I 

Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

| : \\ | : M I : I I : I : I I I I I I I h I I I I I I I I I I I I I H I I Ml M M II II II I 

Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

| : : :: : : I : I I I I I I I I I I I II : : I MM : : : I M I M M 

Db 776 FESMIEYENKEKLSALPPEGGKPYLESFKLSLINTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Q y 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVS 868 

: I I : i M I I I III M M M II II M M I I I II II I MM II IMMIIIIIII 
Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK--DEVHVSDEFSENRSSVSKASISPSNV 926 

MUM II I I I I M I II I II I I I : : : I I M I M I : I I : I M 
Db 896 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

Ml I I : I II I I I II I I I I I II I M M II I M : I II I II II II I I I I II I II 
Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGVVFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRA 1046 

M I I I II I I I II I I I I M M I II M II I I M I I I II II M I I I I 1 M M I I 

Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFPA 1075 

Qy 1047 Y L E S E VAI S E ELVQ K Y S N S AL GH VN S T I K E L RRL FL VD D LVD S L K FAVLMWVFT YVG AL F 1106 

I M M I I I I I I II I I I I I II II I II II II II II I I II I I I M I M II M II I 

Db 1076 YLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy H07 NGLTLLIIAJ.ISLFSIPVIYERHQVQIDHYLGIANKSVKDAMAKIQAKIPGLKRKAD 1163 

II II II I I I I I I I II M II I I I I I I I I I I I = ! i I I I 1 I 1 I I I t I I I I I I t = 

Db 1136 NGLTLLIIALISLFSVPVIYERHQAQIDHYLGLT^KNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 12 
US-10-205-194-164 

; Sequence 164, Application US/10205194 

; Publication No. US20030134301A1 

; GENERAL INFORMATION: 

; APPLICANT: Warner-Lambert Company 

; APPLICANT: Lee, Kevin 



APPLICANT: Dixon, Alistair 
APPLICANT: Brooksbank, Robert 
APPLICANT: Pinnock, Robert 

TITLE OF INVENTION: Identification and Use of Molecules Implicated in Pain 
FILE REFERENCE: WL-A-018201 
CURRENT APPLICATION NUMBER: US/ 10/2 05 , 194 
CURRENT FILING DATE: 5200-07-24 
PRIOR APPLICATION NUMBER: GB 0118354.0 
PRIOR FILING DATE: 2001-07-27 
NUMBER OF SEQ ID NOS : 177 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 164 
LENGTH: 37 9 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
FEATURE : 

OTHER INFORMATION: Foocen-m2 reticulon 
US-10-205-194-164 

Query Match 25.9%; Score 1513; DB 14; Length 379; 

Best Local Similarity 32.6%; Pred. No. 1.4e-68; 

Matches 379; Conservative 0; Mismatches 0; Indels 784; Gaps 1; 

Q y 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

| 1 I I I I M I M I I I I I ! I I I I I 1 I I I I I I I I M I Mill 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Q y 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

| | | | | M | I II I I I 1 I I I I I I I M I I II I I I I I I I I I I I I I I I I I I M I I I I I II I I I M 
Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAP PAAPERQPSWERSPAAPAPSLP 12 0 

Q y 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPIAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

| | | | | | | | | I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I M I II I I I I I I I M I M I 
Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

I I I I I I I I I I I 

Db 181 AASEPVIPSSA 191 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 



Db 192 



191 



Qy 301 AI LVENTKEEVI VRS KDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMS WA 360 

Db 192 191 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVIJiARANv^SKVDRKCLEDSLEQKSLGKDSEGR 420 



Db 192 



191 



Qy 421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 



Db 192 



191 



Qy 



481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 



Qy 541 NEATGTKIAYETKVDLVQTSE^IQESLYPTAQLCPSFEE^VEATPSPVLPDIVMEAPLNSL 600 

Db 192 191 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Db 192 191 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Db 192 191 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Db 192 191 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Db 192 191 

Q y 841 UDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Db 192 191 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Db 192 191 

q v 961 RS L SAVL SAELS KT S WDLLYWRDI KKT GWFGAS LFLLLS LTVFS I VS VTAYI ALALLS 1020 

| | | | | | I I I I I I I I II I I 1 t I I I I M I I I I I I I I I I I I I I M I M 
Db 192 VVD L L YWRD I KKT GVVFGAS L FLL L S LTVFS I VS VTAYI ALALLS 236 

Qy 1021 VT I S FRI YKGVI QAI QKS DEGHP FRAYLE S EVAI S EELVQKYSNSALGHVNST I KELRRL 1080 

|| I I I II I I I I II I I MINIM I I II I I I I I I I 

Db 237 VT I S FRI YKGVI QAI QKS DEGHP FRAYLE S EVAI S EELVQKYSNSALGHVNST I KELRRL 296 

Qy 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

| | | | | | | | | | II II I I I I I I M M I I I M I I I I I I I I I M M 

Db 2 97 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 356 

Qy 1141 NKSVKDAMAKIQAKI PGLKRKAD 1163 

It I I II I II I II I I M I I I I II I 
Db 357 NKSVKDAMAKIQAKI PGLKRKAD 379 

RESULT 13 
US-09-893-348-20 

Sequence 20, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 



; TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM- SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EI S- SCHWARTZ =2 A 
CURRENT APPLICATION NUMBER: US/ 09/ 8 93 , 34 8 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US 98/ 147 15 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 29 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 20 
LENGTH: 360 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-20 

Query Match 24.1%; Score 1411.5; DB 9; Length 360; 

Best Local Similarity 31.0%; Pred. No. 1.7e-63; 

Matches 360; Conservative 0; Mismatches 0; Indels 803; Gaps 1; 
Q V i MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I 1 | M I I I I I I I I M I 1 I I I I I I I I M M 1 I ! I I I I I I I I t I I I I I I I I I I I I 

Db i MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qv 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

| | | | | | | | | | | I M I I I I I I I I I I I I I I I II 1 I I I I I M I I I I I I M I M I I I I I 

Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

| | | | Mill I I II I I I I I I I I I I M I I I I II I I I I I I I I I 

Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSV 172 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Db 173 "~ 172 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

172 

Db 173 

Qy 301 AI LVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

_ 172 

Db 173 1/Z 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVTiAARANVESKW 420 

Db 173 ■ 172 

Qy 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 



Db 



173 



172 



Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

172 

Db 173 17 

Qy 541 NEATGTK IAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Db 173 172 

Q y 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEE^MNVALKALGTKEGIKEPESF 660 

172 

Db 173 1 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

1 72 

Db 173 1 /Z 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Db 173 172 

Qy 781 NLHSTKDAASND1PTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Db 173 172 

Qy 841 UDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Db 173 172 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Db 173 172 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

I I I I 1 I I I I I M I I I I I I I I I M I I I I I I I I M I I I I I I I 

Db 173 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 217 

QY 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHWSTIKELRRL 1080 

| | | | | | M II I I I I I I I I M I I I M I M M I I I II I I I II M I I I M 

Db 218 VT I S FRI YKGVI QAI QKS DEGHP FRAYLES EVAI S EELVQKYSN S ALGHVN ST I KELRRL 277 

Qy 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

Mill II I I I I I I I I I I I I I I I M I II I I I I I M I 1 I I I I II M I I 

Db 27 8 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 337 

Qy 1141 NKSVKDAMAKI QAKI PGLKRKAD 1163 

I II I II I II I M I I I I I I I I I I I 
Db 338 NKSVKDAMAKI QAKI PGLKRKAD 360 



RESULT 14 
US-09-789-386-6 

; Sequence 6, Application US/09789386 
Patent No. US20020010324A1 
GENERAL INFORMATION: 
APPLICANT: MICHALOVICH, DAVID 
APPLICANT: PRINJHA, RABINDER KUMAR 
TITLE OF INVENTION: NOVEL COMPOUNDS 



FILE REFERENCE: GP-30165-C1 

CURRENT APPLICATION NUMBER: US/09/789,386 
CURRENT FILING DATE: 2001-02-21 
PRIOR APPLICATION NUMBER: U.K. 9916898.1 
PRIOR FILING DATE: 1999-07-19 
PRIOR APPLICATION NUMBER: U.K. 9816024.5 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: US 09/359,208 
PRIOR FILING DATE: 1999-07-22 
NUMBER OF SEQ ID NOS : 6 

SOFTWARE : FastSEQ for Windows Version 3.0 
SEQ ID NO 6 
LENGTH: 373 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-6 

Query Match 20.4%; Score 1191; DB 9; Length 373; 

Best Local Similarity 27.8%; Pred. No. 2.5e-52; 

Matches 327; Conservative 12; Mismatches 20; Indels 818; Gaps 7; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

| | | : | | | | | I I I I I II I I I I I I I II I I I I I II : I I :: M I I I I I I I I I I I 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 5 8 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFS S DSVPPAPRGPLPAAPPAAPERQP SWERS PAA P 115 

MINIM II III MI:M :l I I I I I I I I I I I I I I MINIM: || : I 
Db 59 p AAGL S AAP VP T AP AAGAP LMD FGN D FVP PAP RGP L P AAP P VAP ERQ P S WD P S P VS S T VP 118 



Qy 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Ml II I I II I I I N M I I II I I I I I I I UN I N I I I I I I I M I 

Db lig ApsPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 17 8 



Qy 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 22 6 

I II 

Db 179 RGSS 182 

Qy 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 2 86 

Db 183 182 

Qy 287 EMGSSFKGSPKGESAI LVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRVVSPEK 346 

Db 183 182 

Qy 347 TMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLED 4 06 

Db 183 182 



Q y 407 SLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTS 4 66 

Db 183 " 182 

Q y 4 67 ENKTDEKKIEERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEG 526 



Db 



Qy 527 LTPDLVQEACESELNEATGTKIAYETKVPLVQTSEAIQESLYPTAQLCPSFEEAEATPSP 586 

Db 183 182 

Qy 587 VLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALK 646 

Db 183 182 

Qy 647 ALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVP 706 

Db 183 182 

Qy 707 EHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSAS 766 

Db 183 182 

Qy 767 PQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDK 826 

Db 183 182 

Qy 827 IKESETFSDSSPIEIIDEFPTFVS7VKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCL 886 



Db 



Db 183 



182 



0v 887 ELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTK 946 

Y II: 

Db 183 " GSV 185 

Qy 947 



1006 



EAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFS 

I I I I I I I I I II I I I I I I I I I I I I M I I M I I 
186 WDLLYWRDIKKTGWFGASLFLLLSLTVFS 216 



Qy 1007 I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGHP FRAYLES EVAI SEELVQKYSN S A 1066 

| | | | | | M I I II I I I II I II I I I M I I I M M I II I I I II I I I II N 

Db 217 I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGHP FRAYLES EVAI S EELVQKYSN S A 276 

Qv 1067 LGHVNSTI KELRRLFLVDDLVDS LKFAVLMWVFT YVGALFNGLTLLI LALI SLFS I PVI Y 1126 

Mill I I I M I II II I I I I I I I I I ! i i I I I M ! I ! U ! ! I M I I I I I : I I I I 

Db 277 LGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIY 336 

Qy 1127 ERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I 1 1 I I I I :IIIIMIIMMIMIIII: 

Db 337 ERHQAQIDHYLGIANKNVKDAMAKIQAKI PGLKRKAE 373 

RESULT 15 
US-09-765-205-6 

Sequence 6, Application US/09765205 
Patent No. US20020034800A1 
GENERAL INFORMATION: 
APPLICANT: Cao, Li 

TITLE OF INVENTION: BONE MARROW SECRETED PROTEINS AND POLYNUCLEOTIDES 
FILE REFERENCE: 1458.004/200130.4 4 9 
CURRENT APPLICATION NUMBER: US/09/765, 205 
CURRENT FILING DATE: 2001-01-17 
PRIOR APPLICATION NUMBER: US/ 09/212 , 440 
PRIOR FILING DATE: 1998-12-16 



NUMBER OF SEQ ID NOS : 46 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 6 
LENGTH: 373 
TYPE: PRT 
ORGANISM: human 
US-09-765-205-6 

Query Match 20.4%; Score 1191; DB 9; Length 373; 

Best Local Similarity 27.8%; Pred. No. 2.5e-52; 

Matches 327; Conservative 12; Mismatches 20; Indels 818; Gaps 7; 

Q V i MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

|||:|M INN M I I I M I I I t I I I I , I I 1:1 I ' I I Ml ! II I 1 

Db i MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 p AAGL S AAAVP — P AAAAP LL D FS S D S VP PAP RGP L PAAP P AAP ERQ P S WE RS P AA P 115 

| | | | | | M I I I I I I I I : M : I II I I I I I I I M I I I I I I! I I I I : M : I 
Db 59 P AAGL S AAP VPTAPAAGAPLMDFGNDFVP PAPRGPLPAAP PVAPERQPSWDPSPVSSTVP 118 



Qy 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P ST PAAPKR 166 

Ml M I I I I I M I II I I II I M I I II I II M I I I I I I I I II I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 



Qy 167 RGS GSVDETLFALPAAS EPVI P S S AEKIMDLMEQP GNTVS S GQEDFP S VLLETAAS LP S L 226 

111 182 

Db 179 RGSS 1DZ 

Qy 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Db 183 182 

Qy 287 EMGS S FKGSPKGESAI LVENTKEEVI VRS KDKEDLVCSAALHS PQES PVGKEDRWS PEK 346 

Db 183 182 

Qy 347 TMDIFNEMQMSWAPVREEYT^FKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLED 4 06 

Db 183 182 

Qy 407 SLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTS 466 

Db 183 182 

Qy 467 ENKTDEKKIEERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEG 526 

Db 183 182 

Qy 527 LTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSP 586 

Db 183 182 

Qy 587 VLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALK 64 6 

Db 183 182 

Qy 647 ALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVP 7 06 



Qy 707 EHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSAS 766 

Db 183 182 

Qy 767 PQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDK 826 

Db 183 182 

Qy 827 IKESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCL 886 



Db 183 



182 



0v 887 ELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTK 946 

Y I i : 

GSV 185 



Db 183 



1006 



Ov 947 EAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFS 
Y I I I I I I I II I I I I I I I I I I I I I I I M M I I I 

Db 186 WDLLYWRDIKKTGWFGASLFLLLSLTVFS 216 

Ov 1007 I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGHP FRAYLES EVAI S EELVQKYSN S A 1066 

| M | | | | | M I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I M I I I 

Db 217 I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGH P FRAYLES EVAI S EELVQKYSN S A 276 

Qy 1067 LGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLIIAlLISLFSIPVIY 1126 

Mill I I I I I I I I I I I I I I MIIIMIIIIIMMIMIMMIMIhllM 

Db 277 LGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIY 336 

Qy 1127 ERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

| I I I | | | I I I I I I I I : I I I I I I I I I I I I I I I M I I : 
Db 337 ERHQAQIDHYLGLANKNVKDAKAKIQAKI PGLKRKAE 373 



Search completed: September 29, 2004, 18:48:11 
Job time : 131.84 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table : 



Searched: 



September 29, 2004 f 18:06:43 ; Search time 91.4948 Seconds 

(without alignments) 
4010.587 Million cell updates/sec 

US-09-830-972-2 
5848 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKIPGLKRKAD 1163 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1017041 seqs, 315518202 residues 



Total number of hits satisfying chosen parameters; 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1017041 



Database 



SPTREMBL 25:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 



sp_archea : * 
sp_bacteria : * 
sp_f ungi : * 
sp_human : * 
sp_invertebrate : * 
sp_mammal : * 
sp_mhc : * 
sp_organelle : * 
sp_phage : * 

sp_plant : * 

sp_rodent : * 

sp_virus : * 

sp_vertebrate : * 

sp_unclassif ied: * 

sp_rvirus : * 

sp_bacteriap : * 

sp_archeap : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 



1 


5312.5 


90. 


8 


1162 


11 


Q8BGM9 


Q8bgm9 mus musculu 


2 


5307 


90. 


7 


1163 


11 


Q8K3G8 


Q8k3g8 mus musculu 


3 


4501.5 


77. 


0 


1046 


11 


Q8BGK7 


Q8bgk7 mus musculu 


4 


3627.5 


62. 


0 


986 


4 


Q8IUA4 


Q8iua4 homo sapien 


5 


3299.5 


56. 


4 


720 


11 


Q7TNB7 


Q7tnb7 mus musculu 


6 


2926 


50. 


0 


639 


11 


Q8K290 


Q8k290 mus musculu 


7 


2610 


44. 


6 


578 


11 


Q80W95 


Q80w95 mus musculu 


8 


1416 


24. 


2 


375 


11 


Q8BHF5 


Q8bhf5 mus musculu 


9 


1314.5 


22. 


5 


356 


11 


Q8BH78 


Q8bh7 8 mus musculu 


10 


1304 


22. 


3 


357 


11 


Q8K3G7 


Q8k3g7 mus musculu 


11 


1283.5 


21. 


9 


392 


4 


Q96B16 


Q96bl6 homo sapien 


12 


878 


15. 


0 


184 


6 


Q7YRW9 


Q7yrw9 bos taurus 


13 


876 


15. 


0 


199 


13 


Q7T224 


Q7t224 gallus gall 


14 


844 


14. 


4 


179 


6 


Q9GM33 


Q9gm33 macaca fasc 


15 


792 


13. 


5 


780 


11 


Q8K4S4 


Q8k4s4 mus musculu 


16 


788 


13. 


5 


780 


11 


Q8K0T0 


Q8k0t0 mus musculu 


17 


762.5 


13. 


0 


760 


13 


Q90638 


Q90638 gallus gall 


18 


737 


12. 


6 


214 


13 


Q7T222 


Q7t222 carassius a 


19 


700 


12. 


0 


643 


11 


Q8CCU2 


Q8ccu2 mus musculu 


20 


685 


11. 


.7 


199 


4 


Q9BQ59 


Q9bq59 homo sapien 


21 


671 


11. 


.5 


267 


11 


Q63765 


Q63765 rattus sp. 


22 


669 


11, 


.4 


208 


13 


Q90637 


Q90637 gallus gall 


23 


625.5 


10. 


,7 


236 


11 


Q8VBU0 


Q8vbu0 rattus norv 


24 


625.5 


10. 


,7 


237 


11 


Q8C6D5 


Q8c6d5 mus musculu 


25 


590.5 


10. 


,1 


221 


13 


Q7ZUD6 


Q7zud6 brachydanio 


26 


586 


10. 


,0 


595 


5 


Q9VMV9 


Q9vmv9 drosophila 


27 


532 


9. 


,1 


224 


5 


Q9VMW1 


Q9vmwl drosophila 


28 


520 


8. 


, 9 


202 


5 


Q9VMW2 


Q9vmw2 drosophila 


29 


520 


8 . 


.9 


222 


5 


Q9VMW4 


Q9vmw4 drosophila 


30 


520 


8. 


,9 


234 


5 


Q9VMW3 


Q9vmw3 drosophila 


31 


517 


8 . 


.8 


2484 


5 


Q9U347 


Q9u347 caenorhabdi 


32 


503.5 


8. 


.6 


2607 


5 


Q23187 


Q23187 caenorhabdi 


33 


377 


6, 


,4 


2768 


5 


Q9VC00 


Q9vc00 drosophila 


34 


347 .5 


5. 


.9 


10578 


5 


Q8ISF5 


Q8isf5 caenorhabdi 


35 


344 


5. 


.9 


107 


13 


Q7T223 


Q7t223 carassius a 


36 


343.5 


5. 


.9 


18519 


5 


Q8ISF6 


Q8isf6 caenorhabdi 


37 


343.5 


5, 


.9 


18534 


5 


Q8ISF7 


Q8isf 7 caenorhabdi 


38 


342.5 


5, 


.9 


1417 


3 


Q871Y7 


Q871y7 neurospora 


39 


335 


5, 


.7 


5412 


5 


Q9W596 


Q9w596 drosophila 


40 


331 


5, 


.7 


4900 


5 


Q9N541 


Q9n541 caenorhabdi 


41 


328.5 


5, 


.6 


5327 


5 


076891 


076891 drosophila 


42 


322 


5, 


.5 


7962 


4 


Q10465 


Q10465 homo sapien 


43 


320 


5, 


.5 


222 


5 


Q23188 


Q2318 8 caenorhabdi 


44 


317.5 


5 


.4 


17352 


5 


Q95YM2 


Q95ym2 procambarus 


45 


313.5 


5 


.4 


1444 


5 


Q9VTN2 


Q9vtn2 drosophila 



ALIGNMENTS 



RESULT 1 
Q8BGM9 

ID Q8BGM9 PRELIMINARY 
AC Q8BGM9; 

DT 01-MAR-2003 (TrEMBLrel. 
DT 01-MAR-2003 (TrEMBLrel. 
DT 01-OCT-2003 (TrEMBLrel. 



PRT; 1162 AA. 
23, Created) 

23, Last sequence update) 
25, Last annotation update) 



DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=1009 0; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T . , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T . , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7; 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129SvcJ7; 

RA Van der Putten H., Mir A.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY102284; AAM73506.1; -. 

DR EMBL; AY102286; AAM73511.1; -. 

DR MGD; MGI : 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1162 AA; 126613 MW; 855697FBEE1178 IF CRC64; 

Query Match 90.8%; Score 5312.5; DB 11; Length 1162; 

Best Local Similarity 91.0%; Pred. No. 9.1e-259; 

Matches 1064; Conservative 37; Mismatches 55; Indels 13; Gaps 6 

MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I II I I I I I II I 1 I I I II I I I I I I I I I I I I I I I I = I I I I I I I I I M I II I I I I I 
MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I I I I I I I I III I I M I I I I M I I I I I I M I II I I I I I II I I I I I I I I I I I I Mill 
P AAGL S AAP VP P - AAAP L LD FS S D S VP PAP RG P L P AAP PT AP E RQ P S WE RS P AAS AP S L P 118 

PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II 
PAAAVLPSKLPEDDEPPAR--PPAPAGASPLAEP7VAPPSTPAAPKRRGSGSVDETLFALP 17 6 

AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 
I I I I I I I M II I I M I I II I I I I I I II I I M M II I I 1 I II I I I I I I I I I I I I I I I I I 



Qy 


1 


Db 


1 


Qy 


61 


Db 


60 


Qy 


121 


Db 


119 


Qy 


181 


Db 


177 



Qy 

Db 



241 
237 



GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 

I I I I I I : I : I I I I I I I I I I I I : II I I I I I I I I I M : I I I I I II I I I I I I I II I II I I 
GNLSAVASTEGTIEETLNEASRELPERATNpFVNRESAEFSVLEYSEMGSSFNGSPKGES 



300 
296 



Qy 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESP VGKEDRWS PEKTMDI FNEMQ 355 

I : I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I Ml I : I I I I I I I I I I M : 

Db 2 97 7\MLVENTKEEVIVRSKDKEDLVCSAALHNPQES PAT LTKWKEDGVMS PEKTMDI FN EMK 356 

Qy 356 MSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGK 415 

I I I I | I I I I I I I I I I I I I I I I I I I II II II I II I I I I I I : I I I I I : I I I I I I I I I M 
Db 357 MSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANMESKVDKKCFEDSLEQKGHGK 416 

Qy 416 DSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKI 475 

Ml llhllll III III I I I I I I I 11:111111 II I I : I I I I I I I I I M I I I I 
Db 417 DSESRNENASFPRTPELVKDGSRAYITCDSFSSATESTAANIFPVLEDHTSENKTDEKKI 476 

Qy 476 EERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEA 535 

II I I I I I I I II I I I I I I : I I I I I I I I I I II I I I I I I : I I I II I I I I I I I I 

Db 477 EERKAQIITEKTSPKTSNPFLVAIHDSEADYVTTDNLSKVTEAWATMPEGLTPDLVQEA 536 

Qy 536 CESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEA 595 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I : I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 537 CESELNEATGTKIAYETKVDLVQTSEAIQESI YPTAQLCPSFEEAEATPSPVLPDIVMEA 596 

Qy 596 PLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIK 655 

I I I I I I I I I I I I Ml I I I I I I I I I I M I I I I I I M I I I I I : I I I I : I I M 
Db 597 PLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDSKEEIK 656 

Qy 656 EPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDS 715 

I I I I I II I II I I I I I I I I I I I I I II I I I I I I I I : I I I I I I I I I I I I I I I : I N ' : I i 
Db 657 EPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFEKSVPDHCELVDDS 716 

Qy 716 SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK-EERLSASPQELGKPY 774 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I III : I I I I I I M I : I II I 
Db 717 SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPY 776 

Qy 775 LESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFS 834 

M I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I : I II I I M 
Db 777 LESFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAIYSNDDLLSSKEDKMKESETFS 836 

Qy 835 DSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSF 894 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I : I I I I : I I I I I I I I I I I I 
Db 837 DSSPIEIIDEFPTFVSAKDDSP KEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSF 893 

Qy 8 95 KNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPS 954 

I I I I I I I I I I I I I I : : I I I I II : I I I I I I I I I I : I I I I I I I I I I : I I I I 
Db 894 KNTYPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKPKVLTKEAEEKLPS 953 

Qy 955 DTEKEDRSLSAVLSAELS KT S WDLLYWRDI KKTGWFGAS LFLLLSLTVFS I VSVTAYI 1014 

I I I I I I I I I : I I I I I I I : I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M II I 
Db 954 DTEKEDRSLTAVLSAELNKTSVVDLLYWRDI KKTGWFGAS LFLLLSLTVFS I VSVTAYI 1013 

Qy 1015 ALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTI 1074 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I I I I I M I I 
Db 1014 ALALLSWISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTI 1073 



Qy 1075 KELRRLFLVDDLVDS LKFAVLMWVFT YVGAL FNGLTLLI LAL I S L FS I PVI YERHQVQ I D 1134 

I | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II M I I I I I I I I I III 
Db 1074 KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQAQID 1133 

Qy 1135 HYLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 

I I I t I I I I I I I I I II I I 1 I i I I I I I I I I * 
Db 1134 HYLGLANKSVKDAMAKIQAKIPGLKRKAE 1162 



RESULT 2 
Q8K3G8 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



PRELIMINARY; 



PRT; 1163 AA. 



22, Created) 

22, Last sequence update) 
25, Last annotation update) 



Q8K3G8 
Q8K3G8; 

01-OCT-2002 (TrEMBLrel. 
01-OCT-2002 (TrEMBLrel . 
01-OCT-2003 (TrEMBLrel . 
Nogo-A. 
RTN4 . 

Mus mus cuius (Mouse) . 
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus, 
OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 
RC STRAIN=BALB/c; 

RA Jin W., Long M. , Li R., Ju G. ; 

RT "Cloning and expression of the mouse Nogo-A protein."; 
RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
DR EMBL; AY114152; AAM77068.1; -. 
DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 
DR GO; GO: 0007399; P : neurogenesis ; IDA. 
DR InterPro; IPR003388; Reticulon. 
DR Pfam; PF02453; Reticulon; 1. 
DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1163 AA; 126691 MW; 6B5F362799417EA4 CRC64; 



Query Match 90.7%; Score 5307; DB 11; 

Best Local Similarity 91.1%; Pred. No. 1.7e-258; 
Matches 1066; Conservative 34; Mismatches 56; 



Length 1163; 
Indels 14; Gaps 



7; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


60 


Qy 


121 


Db 


119 


Qy 


181 


Db 


177 



MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I II I I II I I I I I II I I I I I M I I I I I : I I I M I I I I I I II I I I I I I 

MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

P AAGL S AAAVP P AAAAP LL D FS S D S VP PAP RG P L P AAP P AAP E RQ P S WERS P AAP AP S L P 12 0 

I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I Mill 

PAAGLSAVPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 



PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 

I I I I II II I I I II I II I I I II I I II I I I I I I I I I I M II I I I II I I I I I II I M I I I 
PAAAVLPSKLPEDDEPPAR — P P AP AGAS P LAE P AAP P S T P AAP K RRG S G S VD ET L FAL P 



M M I I I I I II I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I M 



180 



176 



240 



236 



Qy 


O A 1 

^41 


Db 


237 


Qy 


3U1 


Db 


297 


Qy 


o c a 
ODD 


Db 


357 


Qy 


416 


Db 


417 


Qy 


476 


Db 


477 


Qy 


536 


Db 


537 


Qy 


596 


Db 


597 


Qy 


656 


Db 


657 


Qy 


716 


Db 


717 


Qy 


775 


Db 


111 


Qy 


o o t 

8 35 


Db 


837 


Qy 


8 95 


Db 


894 


Qy 


955 


Db 


954 


Qy 


1014 


Db 


1014 



GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I I I I I I : I : I I I I I I I I I I I I : I I I I I I 1 I I I I I I : MM M M M M M M M M I 
GNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGES 296 

AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES P VGKEDRWSPEKTMDIFNEMQ 355 

M M M M M M M M M M M M M M : M M I I Ml M I I I I II I I II h 

AMLVENTKEEVIVRSKDKEDLVCSAALHNPQESPATLTKWKEDGVMSPEKTMDIFNEMK 356 

MSVVAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGK 415 

I M M M M M M M M M M M M M M M M M I I M : M M I : I I M I M 

MSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANMESPCVDKKCFEDSLEQKSHGK 416 

DSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKI 475 

Ml I I I : I II I I I I I III I I M I I I II I I I II II II I I : I II II I I I II I I I I I 
DSESRNENASFPSTPELVKDGSRAYITCDSFTSATESTAANIFPVLEDHTSENKTDEKKI 47 6 

EERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEA 535 

II II I I I M I I M II I I I I I I I I : II M II I I I I II I I I I I I : II I II I I I I I I II 
EERKAQIITEKTSPKTSNPFLVAIHDSEADYVTTDNLSKVTEAVVATMPEGLTPDLVQEA 536 

CESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEA 595 

I II I M I II I I I I II I M I I I I I I I : I II I I M I I II II I II M I I I I I I I I I I 

CESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEA 596 



M I II I II I I I I III MM I I I I I I I I I I II I I M I I I I I : II I I II II 

PLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDAKEEIK 656 

EPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDS 715 

I I I I M II M II II I I I II II I M I I I II I I I I I II I I I I I M I I I II : I IIMII 
EPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNYSEIAKFEKSVPDHCELVDDS 716 



II I I I I I II II M I I M I II I II I I II I I M M I I I I M I III : I I I I I I II h I I M 
SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPY 776 

LESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFS 834 

I I M I I I II II I I I II : II I II I I I I II I II I II I I I I M I I I I I I I I I I : I M I I II 
LESFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAIYSNDDLLSSKEDKMKESETFS 8 36 

DSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSG7VDSLPCLELPCDLSF 894 

I I II I II II I I II I I I I I II II I M M I I II : M I I I I : I I I I : M I I I I I I I I II 
DSSPIEIIDEFPTFVSAKDDSP KEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSF 8 93 

KNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPS 954 

II Mill I II I I M :: I M I I I : I I I II I I II I : II I I I II I II : I I I I 
KNTYPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKPKVLTKEAEEKLPS 953 

DTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGW-FGASLFLLLSLTVFSIVSVTAY 1013 

I I I I M II I : I II I II I : I M I I I II I I I II II I I I I I I I I II I I I II I I II I I I M M 
DTEKEDRSLTAVLS7VELNKTSWDLLYWRDIKKTGWYFGASLFLLLSLTVFSIVSVTAY 1013 

IALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNST 1073 

I I II I I I II I I I I I I I I II I I I M II I I I I I M I I I I I I II I II I II I I I I II M II M I 
IALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSAiGHVNST 1073 



Qy 1074 IKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQI 1133 







1 I I 1 1 I M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 II 1 1 1 1 II 


Db 


1074 


IKELRRLFLVDDLVDSLKFAVLMWFTWGALFNGLTLLILALISLFSIPVIYERHQAQI 1133 


Qy 


1134 


DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 






M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 


Db 


1134 


DHYLGLANKSVKDAMAKIQAKI PGLKRKAE 1163 



RESULT 3 




Q8BGK7 




ID 


Q8BGK7 PRELIMINARY; 


PRT; 1046 AA. 


AC 


Q8BGK7; 




DT 


01-MAR-2003 (TrEMBLrel. 23, 


Created) 


DT 


01-MAR-2003 (TrEMBLrel. 23, 


Last sequence update) 


DT 


01-OCT-2003 (TrEMBLrel. 25, 


Last annotation update) 


DE 


RTN4 . 




GN 


RTN4 . 




OS 


Mus musculus (Mouse) . 




OC 


Eukaryota; Metazoa; Chordata 


; Craniata; Vertebrata; Euteleostomi ; 


OC 


Mammalia; Eutheria; Rodentia 


; Sciurognathi; Muridae; Murinae; Mus. 


OX 


NCBI TaxID=10090; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129/SvcJ7, and 129SvcJ7; 


RA 


Oertle T. , van der Putten H. 


, Schwab M. E. ; 


RT 


"Genomic Structure and Functional Characterization of the Promoter 


RT 


Structures of Human and Mouse Nogo/Rtn-4 . " ; 


RL 


Submitted (OCT-2002) to the 


EMBL/ GenBank/DDBJ databases. 


RN 


[2] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129/SvcJ7, and 129SvcJ7; 


RA 


Oertle T., Schwab M.E.; 




RL 


Submitted (MAY-2002) to the 


EMBL/ GenBank/DDBJ databases. 


RN 


[3] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129/SvcJ7; 




RA 


Van der Putten H.; 




RL 


Submitted (MAY-2002) to the 


EMBL/GenBank/DDBJ databases. 


RN 


[4] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129SvcJ7; 




RA 


Van der Putten H., Mir A.; 




RL 


Submitted (MAY-2002) to the 


EMBL/ GenBank/DDBJ databases. 


DR 


EMBL; AY102280; AAM73502.1; 




DR 


EMBL; AY102286; AAM73507.1; 




DR 


MGD; MGI: 1915835; Rtn4 . 




DR 


GO; GO: 0005783; C : endoplasmic reticulum; IDA. 


DR 


GO; GO: 0007399; P : neurogenesis ; IDA. 


DR 


InterPro; IPR003388; Reticulon. 


DR 


Pfam; PF02453; Reticulon; 1. 




DR 


PROSITE; PS50845; RETICULON; 


: 1. 


SQ 


SEQUENCE 1046 AA; 114221 


MW; 8CE2E2238ED51222 CRC64 ; 



Query Match 77.0%; Score 4501.5; DB 11; Length 1046; 

Best Local Similarity 86.9%; Pred. No. 4.8e-218; 

Matches 910; Conservative 40; Mismatches 68; Indels 29; Gaps 



4; 



Qy 


143 


Db 


3 


Qy 


183 


Db 


63 


Qy 


243 


Db 


123 


Qy 


303 


Db 


183 


Qy 


358 


Db 


243 


Qy 


418 


Db 


303 


Qy 


478 


Db 


363 


Qy 


538 


Db 


423 


Qy 


598 


Db 


483 


Qy 


658 


Db 


543 


Qy 


718 


Db 


603 


Qy 


111 


Db 


663 


Qy 


837 


Db 


723 


Qy 


897 


Db 


780 


Qy 


957 



PPPAGASPLAEPAAPPSTPA APKRRGSGSVDETLFALPAA 182 

| | M I : I : I I I I I I I I I I I I 



I I I I I I I II I I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I II I I II II I I 



LSAVSSSEGTIEETLNEASKELPERATNPFWRDLAEFSELEYSEMGSSFKGSPKGESAI 302 

I I I I : I : I I I M I I I I I II : I I I I I I I M I I I I : I I I I I I I I I I I I I I I I I I I II I : 
LSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGESAM 182 

LVENTKEEVIVRSKDKEDLVCSAALHSPQESP VGKEDRWS PEKTMDI FNEMQMS 357 

I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I Ml I : I I I I I I I I I I M : I I 



WAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDS 417 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I : M I I I : M I I I I I I I I I I I 
WAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANMESKVDKKCFEDSLEQKGHGKDS 302 

EGRNEDASFPSTPEPVKDSSRAYITCAS FT SATESTTANTFPLLEDHTSENKTDEKKIEE 477 

I llhllll Ml III I II I I II Ihllllll II I I : I I I I I I I I I M I I I I II 
ESRNENASFPRTPELVKDGSRAYITCDSFSSATESTAANIFPVLEDHTSENKTDEKKIEE 362 



I I I I I I I I I I I I I II I : I I I I I I I I I I I I I I II I I : I I II I I I I I I I I I I I 

RKAQIITEKTSPKTSNPFLVAIHDSEADYWTDNLSKVTEAWATMPEGLTPDLVQEACE 422 

SELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPL 597 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I : I I I M I I I II I M I I I I II II I I I I I I I I I 
SELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEAPL 4 82 



I I I I I I I I I I III I I I I I I I I I I II I I I I I I I M I I I I : I I I I : I I Mil 
NSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDSKEEIKEP 542 

ESFN7VAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSP 717 

I M II I II II M I I I II M I II II II II II I : II I I M I II II I II I : I II I : I M I 
ESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFEKSVPDHCELVDDSSP 602 

ESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK-EERLSASPQELGKPYLE 776 

II || II II I II I II I II I II II II II II II I II II II I III : I II M II II : I I I II I 
ESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPYLE 662 

SFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDS 836 

It I II II II II II I : II II II I I II II II I II I II I II II II I II I II : II II II I I I 
SFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAIYSNDDLLSSKEDKMKESETFSDS 722 

SPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKN 89 6 

II || M II II II II I M II I I II II II II : M I I II : II I I : II II I II II I II I I 
SPIEIIDEFPTFVSAKDDSP KEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSFKN 779 

IYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDT 956 

I I II I I I II I II : : II I II I : I I I I I I I I II : I I I I II II II : II II I I 
TYPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKPKVLTKEAEEKLPSDT 839 

957 EKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIT^ 1016 



II I I I I I • I M I M I • I I I t I I I I I I I I i i i i i t i i i i i 

Db 840 EKEDRSLTAVLSAELNKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIAL 899 

Qy 1017 ALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKE 1076 

I I I I II I M I I I I I I II M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 900 ALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKE 959 

Qy 1077 LRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHY 1136 

I I I I I I I I I I II II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 960 LRRLFLVDDLVDSLKFAVLMWVFTWGALFT^GLTLLIIJ^lLISLFSIPVIYERHQAQIDHY 1019 

Qy 1137 LGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I II I I I M I I I I I I I I I I I I I I : 
Db 1020 LGLANKSVKDAMAKIQAKIPGLKRKAE 1046 

RESULT 4 
Q8IUA4 



ID Q8IUA4 PRELIMINARY; PRT; 986 AA. 

AC Q8IUA4; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE RNT4 (RTN4) . 

GN RTN4 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/ DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Van der Putten H.; 

RL Submitted (MAY-2 002) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Testis ; 

RX MEDLINE=22376540; PubMed=12488097 ; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4."; 



RL J. Mol. Biol. 325:299-323(2003). 

DR EMBL; AY102285; AAM64244.1; 

DR EMBL; AY123245; AAM64249.1; -. 

DR EMBL; AY123246; AAM64250.1; -. 

DR EMBL; AY123247; AAM64251.1; -. 

DR EMBL; AY123248; AAM64252.1; 

DR EMBL; AY123249; AAM64253.1; 



DR EMBL; AY123250; AAM64254.1; -. 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 986 AA; 108449 MW; 0CDE8F647O36415A CRC64; 



Query Match 62.0%; Score 3627.5; DB 4; Length 986; 

Best Local Similarity 75.4%; Pred, No. 3.9e-174; 

Matches 745; Conservative 96; Mismatches 126; Indels 21; Gaps 13; 



Qy 

Db 


195 
1 


MDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIE 

Ml | I | | I I : I : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 : II 1 : : 
MDLKEQPGNTISAGQEDFPSVLLETAASLPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQ 


254 
60 


Qy 

Db 


255 
61 


ETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTKEEVIVR 

1 : : I I I I I : 1 : 1 : : 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 III 1 1 1 : : 1 1 : 1 1 : 1 1 : 
ENVSEAS KEVS EKAXTLLI DRDLTEFSELEYSEMGS SFSVS PKAESAVI VANPREEI I VK 


314 
120 


Qy 


315 


SKDKED-LVCSAALHSPQESPVG KEDRWSPEKTMDIFNEMQMSWAPVREEYAD 

: M : | : | I : 1 1 : 1 1 1 1 1 1 1 II 1 1 1 1 1 1 : : : 1 1 1 : 1 II 1 1 1 
NKDEEEKLVSNNILHNQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYAD 


368 


Db 


121 


180 


Qy 


369 


FKPFEQAWEVKDTYEGSRDVLAA RANVESKVDRKCLEDSLEQKSLGKDSEGRNEDA 

1 1 1 1 1 : II 1 1 1 : 1 1 1 : 1 1 1 : 1 : M M 1 : M Mill: 1 1 1 1 1 : 1 
FKPFERVWEVKDSKEDS-DMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDT 


424 


Db 


181 


239 


Qy 


425 


SFPSTPEPVKDSSRAYITCAS FT-SATESTTANTFPLLEDHTSENKTDEKKIEERKAQII 

1 1 1 1 1 1 1 : 1 1 1 II 1 1 1 1 1 : 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M : 1 1 1 1 : 
SFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLGDPTSENKTDEKKIEEKKAQIV 


483 


Db 


240 


299 


Qy 

Db 


484 

300 


TEK-TSPKTSNPFLVAVQDSEADYWTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNE 

Ml II 1 II 1 1 1 1 1 1 MM MUM 1 M 1 M 1: 1 1 II II II 1 1 M 1 1 1 1 1 1 M 1 

TEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNE 


542 
359 


Qy 

Db 


543 
360 


ATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLP 

1 1 1 1 1 1 1 II 1 : 1 1 II II 1 M II 1 1 1 II II 1 1 1 1 1 : 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 M : 1 
VTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVP 


602 
419 


Qy 


603 


SAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKAL-GTKEGIKEPESFN 

1 M II 1 : 1 1 1 1 1 1 1 1 1 : 1 M 1 1 1 1 1 II 1 1 1 II II M M 1 : 1 II Mill: 1 
SAGASVIQPSSSPLEA-SSVNYESIKHEPENPPPYEEAMSVSLKKVSGIKEEIKEPENIN 


661 


Db 


420 


478 


Qy 


662 


AAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEP 

M : 1 II 1 II II II II M II 1 1 1 1 1 II : 1 1 II : 1 1 1 : 1 1 M II : 1 : 1 1 1 1 1 M 1 : M 1 
AALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEP 


721 


Db 


479 


538 


Qy 


722 


VDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-ETVAQHK-EERLSASPQELGKPYLESFQ 

II M II M II : M 1 1 : 1 1 II : 1 1 II II 1 1 : : :: : : 1 : 1 II 1 1 1 1 II 1 1 II : 
VDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFK 


779 


Db 


539 


598 


Qy 


780 


PNLHSTKDA-ASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSP 

: 1 MM : : : II : II 1 1 1 Mill : 1 1 : M 1 1 II II 1 : 1 : 1 : 1 1 1 1 1 1 1 1 
LSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSP 


838 


Db 


599 


658 


Qy 


839 


IEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNI 


897 



I I I I I I I I I : I : I II I I I : I I I II I I I I II I I I II I I I I III III Ml 



Db 


/red 


TrTTnpcnmT too inr^ c pcvt a Dr 1 VTTiT FV^HK'Ct'FT AN APnGAf^T.PCTFTiPHnT.S T.KNT 


718 


Qy 


898 


YPK — DEVHVSDEF3ENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSD 
|| ::: | | : | | : I I : I I : 1 : 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 

QPKVEEKI SFSDDr oKNCjoAI oi\V XiXjIjit ir JJ voALiil y/vtiiJ- riiox vj\r r\v±j v j\i^tt-Cji\n.ijr ou 


955 


Db 


719 


778 


Qy 

Db 


956 
779 


TEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIA 
1 1 1 1 1 1 ||: | M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

TEKEDRSPSAI F SAELoKl b V VULXj i WKJJX i\r\ i v r (a/\oXir XiXiXioXii vroivoviAi 


1015 
838 


Qy 

Db 


1016 

o o n 


LALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIK 
I I I I I 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 

XlAXjXjo V 1 Ibr KX i J\oV X^-ftX ^rvo lJrjori.tr r r\j-\I ±ji1>d i_> vai oiLiijXjV^rvitJiN o-ru-ivjn viN^i x x\ 


1075 
898 


Qy 

Db 


1 U / b 

899 


ELRRLFLVDDLVDSLKFAVLMWFTWGALFNGLTLLIIJVLISLFSIPVIYERHQVQIDH 

M 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 II II II II 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 i 1 1 1 1 INI 

ELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDH 


1135 
958 


Qy 


1136 


YLGIANKSVKDAMAKIQAKI PGLKRKAD 1163 

1 i 1 1 1 11 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 : 
YLGLANKNVKDAMAKIQAKI PGLKRKAE 986 




Db 


959 





RESULT 5 
Q7TNB7 

ID Q7TNB7 PRELIMINARY; PRT; 72 0 AA. 

AC Q7TNB7; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2 003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; TISSUE-Brain; 

RX MEDLINE=22388257; PubMed=12477 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M. , Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 



RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sex. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; TISSUE=Brain; 

RA Strausberg R. ; 

RL Submitted (AUG-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC056373; AAH56373.1; 

KW Hypothetical protein. 

SQ SEQUENCE 720 AA; 77435 MW; 8 0AB7872 8F16EAB2 CRC64; 

Query Match 56.4%; Score 3299.5; DB 11; Length 720; 

Best Local Similarity 90.3%; Pred. No. 7.9e-158; 

Matches 654; Conservative 22; Mismatches 39; Indels 9; Gaps 



Qy 


l 


MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 


60 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 M II 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 




Db 


i 


MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 


59 


Qy 


61 


PAAGLSAAAVPPAAAAPLLDFSSDSVP PAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 


120 




I I I 1 1 1 1 1 III II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


60 


PAAGLSAAPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 


118 


Qy 


121 


PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 M 1 




Db 


119 


PAAAVLPSKLPEDDEPPAR — P P AP AGAS P LAE P AAP P S T P AAP KRRGS G S VDET L FAL P 


176 


Qy 


181 


AASEPVI P S SAEKIMDLMEQPGNTVS SGQEDFPSVLLETAASLP S LS PLSTVS FKEHGYL 


240 




I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 




Db 


177 


AASEPVI PS SAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLS PLSTVS FKEHGYL 


236 


Qy 


241 


GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 


300 




1 1 1 1 1 1 : 1 : 1 1 1 M 1 1 1 1 1 1 1 : 1 1 1 1 1 1 II 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 




Db 


237 


GNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGES 


296 


Qy 


301 


AILVENTKEEVIVRSKDKEDLVCSAALHSPQESP VGKEDRWS PEKTMDI FNEMQ 


355 




1 : I 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 Ml 1 : 1 1 1 1 1 1 1 1 1 1 1 1 : 




Db 


297 


AMLVENTKEEVIVRSKDKEDLVCSAALHNPQESPATLTKVVKEDGVMSPEKTMDIFNEMK 


356 


Qy 


356 MSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGK 


415 




I | | I || I I I I I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 : M 1 1 1 1 1 1 1 II 




Db 


357 


MSWAPWEEYADFKPFEQAWEVKDTYEGSRDVLAARANMESKVDKKCFEDSLEQKGHGK 


416 


Qy 


416 


DSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKI 


475 




Ml llhllll IN 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 II 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


417 


DSESRNENASFPRTPELVKDGSRAYITCDSFSSATESTAANIFPVLEDHTSENKTDEKKI 


476 


Qy 


47 6 EERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEA 

1 . I 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 t 1 1 . 1 1 1 1 1 t 1 1 II 1 1 1 


535 


Db 


477 


M M M I 1 II II II 1 M 1 II II 1 • 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 • t i i i i < i i 

EERKAQIITEKTSPKTSNPFLVAIHDSEADYVTTDNLSKVTEAWATMPEGLTPDLVQEA 


536 


Qy 


536 


CESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEA 


595 




I 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


537 


CESELNEATGTKIAYETKVT)LVQTSEAIQESI YPTAQLCPSFEEAEATPSPVLPDIVMEA 


596 


Qy 


596 


PLN S LLPS AGAS VVQPSVSP LEAP PPVS YDS I KLEPENPPPYEEAMNVALKALGTKEGIK 


655 



I I I I I I I I I I I I III I I I I I Mill I I I I I I II I I I I I I I : I I I I 



Db 597 PLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDSKEEIK 656 



Qy 656 EPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDS 715 

I I I 1 I I I I II I I I I I I I I I I I I I I I 1 I I II I I I : I I I I I I I I I I I I I I I : I 111:11 
Db 657 EPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFEKSVPDHCELVDDS 716 

Qy 716 SPES 719 

I I I I 

Db 717 SPES 720 



RESULT 6 
Q8K290 



ID Q8K290 PRELIMINARY; PRT; 639 AA. 

AC Q8K290; 

DT 01-OCT-2002 (TrEMBLrel. 22 , Created) 

DT 01-OCT-2002 (TrEMBLrel. 22 , Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (JUN-2002) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; BC032192; AAH32192.1; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 639 AA; 70312 MW; 309A19DA37 603F1 1 CRC64; 



Query Match 50.0%; Score 2926; DB 11; Length 639; 

Best Local Similarity 91.6%; Pred. No. 4e-139; 

Matches 588; Conservative 20; Mismatches 30; Indels 4; Gaps 2; 

Qy 523 MPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEA 582 

II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I : I I I I I I I I I I I I I I I 
Db 1 MPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEA 60 

Qy 583 TPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMN 642 

I I II I I 1 I I I I I I I I I I I I I I I I I I III INI I I II I I I I I I I I II I I I I I I I : 
Db 61 TPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMS 120 

Qy 643 VALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFE 702 

I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II : I I I I III I I I I 

Db 121 VALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFE 180 

Qy 7 03 KSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK-EE 7 61 

II I I : I II I : I I I I I I II I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I III : I 

Db 181 KSVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKE 240 



Qy 762 RLSASPQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAI YSNDDLLS 821 

I I I I I I I I : I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I | | | | | | | | | 
Db 241 RLSASPQEVGKPYLESFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAI YSNDDLLS 300 

QY 822 SKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGAD 881 

I M 1 I : I I I I I I I I I I I I I I I I I I I I I I I | | | | | | I I I I I I II I : I I I I | | : | | | | : 
Db 301 SKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDSP KEYTDLEVSNKSEIANVQSGAN 357 

Qy 882 SLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKS 941 

UN I I I I I I I I I I I I I I I I I I I I I I :: I I I I I I : | | | | | | | | || : I I I 
Db 358 SLPCSELPCDLSFKNTYPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKP 417 

Qy 942 KSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLS 1001 

I I ! M I I : I I I I I II I I I I I I : I I I I I I I : I I I I I I I I I I | | | | | | | | | | M I I I II I I 
Db 418 KVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKTGWFGASLFLLLS 477 

Qy 1002 LTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQK 1061 

I I I I I I I I I i I I I I I I I M I I I I I I M I I I I I I I I I II II I I I I I I II I I I I M I I I I I I 
Db 478 LTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQK 537 

Qy 1062 YSNSALGHWSTIKELRRLFLVI)DLVDSLKFAVIiMWFTWGALFNGLTLLIIALISLFS 1121 

I M I I I I I I I II I I I I I I I I I | | | | | | | | | | | | | || | | | | | | | || | | | | | | | | | | | | | | | 
Db 538 YSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLFS 597 

Qy 1122 I PVI YERHQVQI DHYLGIANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I I M I I II I I I I I I I I I M I I I I I | | || | | | | | M : 
Db 598 IPVIYERHQAQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAE 639 



RESULT 7 
Q80W95 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RA 
RT 



RL 
DR 
DR 
DR 
DR 
DR 
FT 
SQ 



Created) 

Last sequence update) 
Last annotation update) 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; Mus . 



Q8 0W95 PRELIMINARY; PRT; 578 AA. 

Q8 0W95; 

01-JUN-2003 (TrEMBLrel. 24, 
01-JUN-2003 (TrEMBLrel. 24, 
01-OCT-2003 (TrEMBLrel. 25, 
Nogo-A (Fragment) . 
NOGO-A. 

Mus musculus (Mouse) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 
Tozaki H., Hirata T. ; 

"The partial sequence of mouse nogo-A cDNA clone#4109 . " ; 
Submitted (OCT-2001) to the EMBL/ GenBank/DDBJ databases. 
EMBL; AB073672; BAC75974.1; -. 
GO; GO:0005783; C : endoplasmic reticulum; IEA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 
NON_TER 1 1 

SEQUENCE 578 AA; 63696 MW; 832670C171E4AC61 CRC64 ; 



Query Match 



44.6%; Score 2610; DB 11; Length 578; 



Best Local Similarity 90.7%; Pred. No. 2.7e-123; 

Matches 527; Conservative 18; Mismatches 32; Indels 4; Gaps 2; 



Qy 


584 


PSPVXPDIVMEAPLNSLLPSAGASVVQPSVSPLEAPPPVSYDSIKLEPENPPPYEELAMNV 

1 1 1 1 1 1 1 1 1 1 1 1 II Ml I I I I | I | | | | Ml 11111:1 

PSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSV 


643 


Db 


1 


60 


Qy 


644 


ALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEK 
Ml II 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 II 1 1 1 1 1 1 II 1 I I I | | | 1 1 1 1 1 1 1 1 | 1 1 I 
ALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNYSEIAKFEK 


703 


Db 


61 


120 


Qy 


704 


SVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK-EER 
1 1 1 : 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 II 1 1 1 1 II 1 I I I I I | | | | | | | | | M 1 : 1 1 
SVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKER 


762 


Db 


121 


180 


Qy 


763 


LSASPQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSS 
1 1 M M 1 : M M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 : 1 II 1 II || 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 
LSASPQEVGKPYLESFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAIYSNDDLLSS 


822 


Db 


181 


240 


Qy 


823 


KEDKIKESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADS 

1 i 1 1 : : 1 1 , 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 M : 1 1 1 1 M : 1 1 1 1 : 1 

KEDKMKESETFSDSSPIEIIDEFPTFVSAKDDSP— KEYTDLEVSNKSEIANVQSGANS 


882 


Db 


241 


297 


Qy 


883 


LPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSK 

111 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 :: 1 1 1 1 I : 1 1 1 1 II 1 1 1 1 : 1 II 1 

LPCSELPCDLSFKNTYPKDEAHVSDEFSKSRSSVFKVPLLLPNVSALESQIEMGNIVKPK 


942 


Db 


298 


357 


Qy 

Db 


943 
358 


SLTKEAEKKLPSDTEKEDRSLSAA^STVELSKTSWDLLYWRDIKKTGWFGASLFLLLSL 
1 1 1 1 II : 1 1 1 1 I 1 1 1 | 1 | | | : M 1 1 1 1 1 : 1 | | | | | | | | | I | | | | | | | | | | | | | | | | j | | 
VXTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKTGWFGASLFLLLSL 


1002 
417 


Qy 

Db 


1003 
418 


TVFSIVSVTAYI7VLALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVA1SEELVQKY 
1 1 i 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 II I M | || | || | | | | | | | || | | | | | | | | | | | | | | | | || | 
TVFS IVSVTAYIALALLSVTIS FRI YKGVIQAIQKSDEGHPFRAYLESEVAI s EELVQKY 


1062 
477 


Qy 

Db 


1063 
478 


SNSALGHWSTIKELRRLFLVDDLVT)SLKF 

1 1 1 1 M 1 II 1 1 1 II 1 1 1 1 1 1 1 I I I I I | | | | | | | || | || | || | | | | | | | | | | | | | | | | | | | 

SNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTWGALFNGLTLLILALISLFSI 


1122 
537 


Qy 


1123 


PVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 || | | | | | | : 
PVIYERHQAQIDHYLGIANKSVKDAMAKIQAKIPGLKRKAE 578 




Db 


538 





RESULT 8 
Q8BHF5 

ID Q8BHF5 PRELIMINARY; PRT; 375 AA. 

AC Q8BHF5; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI TaxID=10090; 



RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7; 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9SvcJ7; 

RA Van der Putten H., Mir A. ; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY102282; AAM73504.1; 

DR EMBL; AY102286; AAM73509.1; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 375 AA; 40300 MW; 23D9EB19BE671AE6 CRC64; 

Query Match 24.2%; Score 1416; DB 11; Length 375; 

Best Local Similarity 31.6%; Pred. No. 1.7e-63; 

Matches 367; Conservative 2; Mismatches 6; Indels 788; Gaps 4 
QY 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 



* ' ' i i i i i i i i i . i i I I I l l l l l l l l l I l I I I 

1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 





Db 



Qy 



61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 





Db 



60 PAAGLSAAPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 



Qy 



121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 





Db 



119 PAAAVLPSKLPEDDEPPAR--PPAPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 176 



Qy 



181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 
I I I I I I I I I I I 

177 AASEPVIPSSA -107 



Db 



Qy 



241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 



Db 



188 



187 



Qy 



301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 



Qy 361 PVREEY7VDFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

Db 188 187 

QY 421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Db 188 18 7 

Qy 481 QHTEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

Db 188 18 7 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Db 188 187 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Db 188 187 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 72 0 

Db 188 ; 18 7 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Db 188 187 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESETFSDSSPIE 840 

Db 188 187 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIT^NIQSGADSLPCLELPCDLSFKNIYPK 900 

Db 188 187 

Qy 901 DEVHVSDEFSENRSSVS KASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Db 188 187 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I M II I I I I I I I I I I I 
Db 188 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 232 

Qy 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

t I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I 1 | | || | | | | | | | | 

Db 233 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKY5NSALGHVNSTIKELRRL 292 

Qy 1081 FLVDDLVDSLKFAVLMWFTYVGALFNGLTLLIIJ^ISLFSIPVIYERHQVQIDHYLGIA 1140 

N I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I II I I I | | I I I I I I I I I 

Db 293 FLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLFSIPVIYERHQAQIDHYLGLA 352 

Qy 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 
I I I I I I I I I I II I I I I I II I I I : 



Db 353 NKSVKDAMAKIQAKI PGLKRKAE 375 



RESULT 9 




Q8BH78 




ID 


Q8BH78 PRELIMINARY; 


PRT; 356 AA. 


AC 


Q8BH78; 




DT 


01-MAR-2003 (TrEMBLrel. 23, 


Created) 


DT 


01-MAR-2003 (TrEMBLrel. 23, 


Last sequence update) 


DT 


01-OCT-2003 (TrEMBLrel. 25, 


Last annotation update) 


DE 


RTN4 . 


GN 


RTN4 . 




OS 


Mus musculus (Mouse) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 


OC 


Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 


OX 


NCBI TaxID=10090; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129/SvcJ7, and 129SvcJ7; 


RA 


Oertle T . , van der Putten H. 


, Schwab M.E. ; 


RT 


"Genomic Structure and Functional Characterization of the Promoter 


RT 


Structures of Human and Mouse Nogo/Rtn-4 . " ; 


RL 


Submitted (OCT-2002) to the 


EMBL/ GenBank/ DDB J databases. 


RN 


[2] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129/SvcJ7, and 129SvcJ7; 


RA 


Oertle T., Schwab M.E.; 




RL 


Submitted (MAY-2 002) to the 


EMBL/ GenBank/ DDB J databases. 


RN 


[3] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129/SvcJ7; 




RA 


Van der Putten Re- 




RL 


submitted (MAY-2002) to the 


EMBL/ GenBank/ DDB J databases. 


RN 


[4] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=12 9SvcJ7; 




RA 


Van der Putten H., Mir A.; 




RL 


Submitted (MAY-2002) to the 


EMBL/ GenBank/DDBJ databases. 


DR 


EMBL; AY102281; AAM73503.1; 




DR 


EMBL; AY102286; AAM73508.1; 




DR 


MGD; MGI: 1915835; Rtn4 . 




DR 


GO; GO:0005783; C : endoplasmic reticulum; IDA. 


DR 


GO; GO: 0007399; P : neurogenesis ; IDA. 


DR 


InterPro; IPR003388; Reticulon. 


DR 


Pfam; PF02453; Reticulon; 1. 




DR 


PROSITE; PS50845; RETICULON; 


1. 


SQ 


SEQUENCE 356 AA; 38403 MW 


4366C03BA9630B56 CRC64; 



Query Match 22.5%; Score 1314.5; DB 11; Length 356; 

Best Local Similarity 29.9%; Pred. No. 2e-58; 

Matches 348; Conservative 2; Mismatches 6; Indels 807; Gaps 4 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I II I I I I I I I I I I I I I I I I I I || | | || I M | | : | | I I I I I I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 



Qy 61 P AAG L S AAAVP P AAAAP L L D F S S D S VP PAP RG P L P AAP P AAP ERQPSWERS P AAP AP S L P 12 0 



1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

Db 60 PAAGLSAAPVPP-AAAPLLDFSSDSVP PAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

I I I I I II I I I 1 I I I I II I I I I I I I I I I I I II I I II I I I I I I I I I M I I I 
Db 119 PAAAVLPSKLPEDDEPPAR— PPAPAGASPLAEPAAPPSTPAAPKRRGSGSV 168 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Db 169 168 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Db 169 168 

Qy 301 AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMS WA 360 

Db 169 168 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

Db 169 168 

Qy 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 4 80 

Db 169 168 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

Db 169 168 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Db 169 168 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Db 169 168 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Db 169 168 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Db 169 168 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Db 169 168 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Db 169 168 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 



Db 



169 



168 



Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I | I I I 
Db 169 WDLLYWRDI KKTGWFGASLFLLLS LTVFS I VS VTAYIALALLS 213 

Qy 1021 VTISFRIYKGVI QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I t I I I I | I M I I II I I I I I I I I M I I I I 
Db 214 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 273 

Qy 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YE RH QVQ I DHYLGLA 1140 

I I I M I I I I II I I I II I I I I I I I I I I I I I I I I I I I I | I | | | | | | M | | | | | | | | | | | M 
Db 274 FLVDDLVDSLKFAVLMWFTYVGALFNGLTLLIIALISLFSIPVIYERHQAQIDHYLGIA 333 

Qy 1141 NKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I II I I I I M II I I I I I I I : 
Db 334 NKSVKDAMAKIQAKI PGLKRKAE 356 



RESULT 10 
Q8K3G7 

ID Q8K3G7 PRELIMINARY; PRT; 357 AA. 

AC Q8K3G7; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Nogo-B. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Jin W., Li R. , Long M. , Shen J., Ju G. ; 

RT "Cloning and expression of the mouse Nogo-B protein. 11 ; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY114153; AAM77069.1; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 357 AA; 38566 MW; 7 3BB3D17DFDBDF15 CRC64; 

Query Match 22.3%; Score 1304; DB 11; Length 357; 

Best Local Similarity 29.9%; Pred. No. 6.6e-58; 

Matches 348; Conservative 2; Mismatches 6; Indels 808; Gaps 5; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I M I I I I I I I I I I I I I I | I I I I | I | | I | | | | | | | | : | | | | | | | | | | M | | | | | M 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 5 9 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAP PAAPERQPSWERSPAAPAPSLP 120 

M I I I I I I III I I I I I I I If I I I I I I I I I | | | || | | | | | | | | M I I I I I I I I II II 



Db 60 PAAGLSAAPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I I I I I I I I I II I I I 1 I I I I II t I I I I I I II I I I I I I I I I I I M I I I I I I 
Db 119 PAAAVLPSKLPEDDEPPAR — PPAPAGASPLAEPAAPPSTPAAPKRRGSGSV 168 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Db 169 168 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Db 169 168 

Qy 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Db 169 168 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

Db 169 168 

Qy 421 NEDASFPSTPEPVKDSSRAYITCAS FT SATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Db 169 168 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

Db 169 168 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Db 169 168 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Db 169 168 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Db 169 168 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Db 169 168 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESETFSDSSPIE 840 

Db 169 168 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIT^NIQSGADSLPCLELPCDLSFKNIYPK 900 

Db 169 : 168 

Qy 901 DEVHVSDEFSENRSSVS KASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Db 169 168 



Qy 961 RSLSAVLSAELSKTSVWLLYWRDIKKTGW-FGASLFLLLSLTVFSIVSVTAYIALALL 1019 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 169 WDLLYWRDIKKTGWYFGASLFLLLSLTVFSIVSVTAYIALALL 213 

Qy 1020 SVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRR 1079 

I I I I I II I I I I II I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 214 SVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRR 273 

Qy 1080 LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGL 1139 

I M I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 274 LFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLFSIPVIYERHQAQIDHYLGL 333 

Qy 114 0 ANKSVKDAMAKIQAKIPGLKRKAD 1163 

M I I I I I I II I II I I I I I I I I I I : 
Db 334 ANKSVKDAMAKIQAKIPGLKRKAE 357 



RESULT 11 
Q96B16 

ID Q96B16 PRELIMINARY; PRT; 392 AA. 

AC Q96B16; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein ( RTN4 ) . 

GN RTN4 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney; 

RA Strausberg R. ; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Oertle T . , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-22376540; PubMed-124 8 8 097 ; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 ." ; 

RL J. Mol. Biol. 325:299-323(2003). 



DR EMBL; BC016165; AAH16165.1; -. 

DR EMBL; AY102285; AAM64242.1; -. 

DR EMBL; AY102278; AAM64247.1; -. 

DR GO; GO:0005783; C: endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 392 AA; 42274 MW; D7B2AA5E8 39E58AD CRC64; 

Query Match 21.9%; Score 1283.5; DB 4; Length 392; 

Best Local Similarity 29.4%; Pred. No. 8.1e-57; 

Matches 346; Conservative 11; Mismatches 20; Indels 801; Gaps 7; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVXERK 60 

I I I : I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I : I I : : I I I I I I I II I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I II II I II I : I I : I I I I I I I I I I I I I I I I I II I I I I : II : I 
Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVP PAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 



Db 17 9 RGSSGSVDETLFALPAASEPVIRSSA 204 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Db 205 204 

Qy 286 SEMGS S FKGS PKGESAI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRVVS PE 345 

Db 2 05 204 

Qy 346 KTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDV1AARANVESKVDRKCLE 405 

Db 205 204 

Qy 406 DSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHT 465 

Db 205 204 

Qy 466 SENKTDEKKIEERKAQIITEKTSPKTSNPFLVAVQDSEADYWTDTLSKVTEAAVSNMPE 525 

Db 205 204 

Qy 526 GLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPS 585 

Db 205 204 

Qy 586 PVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVAL 645 

Db 205 204 



Qy 64 6 KALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSV 705 

Db 205 204 

Qy 7 06 PEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSA 7 65 

Db 205 '■ 204 

Qy 766 SPQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKED 825 

Db 205 204 

Qy 826 KIKESETFSDSSPIEIIDEFPTFVSAKDDSPKIiAKEYTDLEVSDKSEIANIQSGADSLPC 885 

Db 205 204 

Qy 886 LELPCDLSFKNIYPKDEVHVSDEFSENRSSVS KASISPSNVSALEPQTEMGSIVKSKSLT 945 

Db 205 204 

Qy 94 6 KEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1005 

I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I 
Db 205 WDLLYWRD I KKT GWFGAS L FLLLS LTVF 234 

Qy 1006 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1065 

I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | 
Db 235 SIVSVTAYIALALLSVTISFRIYKGVI QAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 294 

Qy 1066 ALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLFSIPVI 1125 

I II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I : I I I 
Db 295 ALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 354 

Qy 1126 YERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

I I I I I I I II I I I I I I I : I I I II I I I I I I I I I I I I I I : 
Db 355 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 392 



RESULT 12 
Q7YRW9 

ID Q7YRW9 PRELIMINARY; PRT; 184 AA. 

AC Q7YRW9; 

DT 01-OCT-2003 (TrEMBLrel. 25, ■ Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4w (Fragment) . 

GN RTN4 . 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed=12 8 322 8 8 ; 

RA Oertle T., Klinger M. , Stuermer C.A., Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 



RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; AY164744; AAP47319.1; -. 

FT NONJTER 1 1 

SQ SEQUENCE 184 AA; 20671 MW; DE990E03BBAF8 4Al CRC64; 



Query Match 15.0%; Score 878; DB 6; Length 184; 

Best Local Similarity 96.2%; Pred. No. 6.9e-37; 

Matches 177; Conservative 3; Mismatches 4; Inciels 0; Gaps 0; 

Qy 980 LYWRDI KKTGWFGAS LFLLLSLTVFS I VS VTAYI ALALLSVT I S FRI YKGVI QAIQKSD 1039 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 LYWRDIKKTGWFGXXLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSD 60 

Qy 1040 EGHPFRAYLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVTiMm/F 1099 

I I I I II I I I II I I I I I I I I I I II I I I I I II I I I I I I I II I I I I I II I I I I M I I I I I I I 
Db 61 EGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVF 12 0 



Qy 1100 TYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLK 1159 

I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I II I I I : I I II I I I M I I I I I I I 
Db 121 TWGAiFNGLTLLIIALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLK 180 

Qy 1160 RKAD 1163 

III: 

Db 181 RKAE 184 



RESULT 13 
Q7T224 

ID Q7T224 PRELIMINARY; PRT; 199 AA. 

AC Q7T224; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4-C. 

GN RTN4 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9 031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed=12 8322 88 ; 

RA Oertle T., Klinger M. , Stuermer C.A., Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 

RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; AY164737; AAP47312.1; -. 

SQ SEQUENCE 199 AA; 22293 MW; 07CF4E4EF2723251 CRC64; 

Query Match 15.0%; Score 876; DB 13; Length 199; 

Best Local Similarity 90.6%; Pred. No. 9.7e-37; 

Matches 173; Conservative 10; Mismatches 8; Indels 0; Gaps 0; 



QY 



973 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1032 
I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I II I I I I I I I I I 



Db 



9 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 68 



wy 




yAlQKoJJiijbhlFl: HAiLhobVAlohhLVQKYSNoALGHVNSTIKELRRLFLVD 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 : 1 1 : 1 : 1 1 1 1 : 1 1 1 : 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | I I I 

QAIQKSDEGHPFRAYLESDVAVSEDLIQKYSSWLGHINGTVKELRRLFLVDDLVDSLKF 


1092 


Db 


69 


128 


Qy 

Db 


1093 
129 


AVLMWFTWGALFNGLTLLILALISLFSIP 

1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 M M 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 II | 1 1 1 
AVIJVTWFTWGALFNGLTLLIIALISLFS 


1152 
188 


Qy 


1153 


AKIPGLKRKAD 1163 

1 1 1 1 1 1 1 1 1 : 
AKIPGLKRKTE 199 




Db 


189 





RESULT 14 
Q9GM33 

ID Q9GM33 PRELIMINARY; PRT; 179 AA. 

AC Q9GM33; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia ; Eutheria ; Primates ; Catarrhini ; Cercopithecidae ; 

OC Cercopithecinae; Macaca . 

OX NCBI_TaxID=9541; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Brain; 

RA Osada N . , Hida M. , Kusuda J., Tanuma R. , Iseki K. , Hirai M. , Terao K., 

RA Suzuki Y., Sugano S., Hashimoto K. ; 

RT "Isolation of full-length cDNA clones from macaque brain cDNA 

RT libraries . "; 

RL Submitted (OCT-2000) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AB049853; BAB16739.1; -. 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 179 AA; 19949 MW; 5F8CD4383FEE9E02 CRC64; 

Query Match 14.4%; Score 844; DB 6; Length 179; 

Best Local Similarity 95.5%; Pred. No. 3.4e-35; 

Matches 171; Conservative 6; Mismatches 2; Indels 0; Gaps 0; 

Qy 985 IKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPF 1044 

: I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I : I I I I I I I I I I I I I I I 
Db 1 MKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYRGVIQAIQKSDEGHPF 60 

Qy 1045 RAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGA 1104 

HUM I I I I I I II I II I I I I I I I II I : I I I I I I 

Db 61 RAYLESEVAISEELVQKYSNS7VLGHWCTIKELRRLFLVDDLVDSLRFAVTJy[WVFTYVGA 120 



Qy 1105 LFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 



Db 121 LFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKM 179 

RESULT 15 
Q8K4S4 



ID Q8K4S4 PRELIMINARY; PRT; 780 AA. 

AC Q8K4S4; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Reticulon 1A. 

GN RTN1 OR RTN-1A. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ICR; TISSUE=Brain; 

RA Hirata T., Nomura T., Takagi Y. , Sato Y. , Tomioka N., Fujisawa H., 

RA Osumi N. ; 

RT "Mosaic development of the olfactory cortex with Pax6-dependent and - 

RT independent components."; 

RL Submitted (NOV-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AB074899; BAB96551.1; 

DR MGD; MGI: 1933947; Rtnl. 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR InterPro; IPR001951; Histone_H4. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS00047; HISTONE_H4; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 780 AA; 83504 MW; 545F5638C576A069 CRC64; 



Query Match 13.5%; Score 792; DB 11; Length 780; 

Best Local Similarity 32.5%; Pred. No. l.le-31; 

Matches 242; Conservative 105; Mismatches 260; Indels 138; Gaps 27; 

Qy 517 EAAVSNMPEGLTPDLVQEACESELNEATG TKIAYETKVDLVQTSEAIQ-ESLY 568 

I I : I I I : : I I : I : : I : I : : | | : : 

Db 7 6 ETASTGM— AAVPDALDHSPSSTLKDGEGACYTSLISDVCYPPREDSAYFTGILQKENGH 133 

Qy 569 PTAQLCPSFEEAEATPSPVLPDIVMEAP LNS LLP SAGAS WQP SVS PLEAP P 620 

I I I I I I I I I I : : I I : I : I : I : I I : 

Db 134 ITTSESP — EEPE-TPGPSLPEVPGMEPQGLLSSDSGIEMT PAESTEVNKILADPLDQMK 190 

Qy 621 PVSYDSIKLE PENPPPYEEAMNVALKALGTK EGIKEPESFNAAV 664 

Ml: I I I : : I I I : I I : : I II 

Db 191 AEAYKYIDITRPQEAKGQEEQHPGLEDKDLDFKDKGTEVSTKAEGVRAPNQ-PAPVEGKL 249 

Qy 665 QETEAPYISIACDLIKETKLSTEPSPDFSNYSEI AKFEKSVPEHAELV 712 

: I MM I :: ::| :| Ml ::::M M 

Db 250 IKDHLFEESTFAPYIDELSD — EQHRVS LVTAPVKITLTEIEPPLMTATQETIPEKQDLC 307 

Qy 713 EDS S PESEPVDLFSDDS I PEVPQTQEEAVMLMKESLTEVSETVAQHK EERLSASP 767 

M:: i I : M : I M I Ml M :M 



Db 308 LKPSPDTVPTVTVSE PEDDSPGSVTPPSSGTEPSAAESQGKGSVSEDELIAAIK 361 

Qy 768 QELGKPYLESFQPN LHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLS S 822 

: I i : I | : : M : III 
Db 362 EAKGLSYETTESPRPVGQVADKPKTKTRSGLPTIPS PLDQEASS 405 

Qy 823 KEDKIKESETFSDSSPIEIIDEFPT FVSAKDDSPKLAK-EYTDLEVSDKSEIAN 875 

I I I I : I : II: II II : I : I : : | : 

Db 4 06 AESGDSEIELVSE-DPMASEDALPSGYVSFGHVSGPPPSPASPSIQYSILREEREAEL — 462 

Qy 876 IQSGADSLPCLELPCDL-SFKNIYPKDEVH VSDEFSENRSSVSKASISPSNVS 927 

II : I I I I III I I I : : I I : 

Db 463 DSELIIE-SCDASSASEESPKREQDSPPMKPGALDAIREETGSRATEERAPSHQG 516 

Qy 92 8 ALEPQTEMGSIVKSKSLTKEAEKKL PSDTEKEDRSLSAVLSAE 970 

: I I II : : I I I : : : : III:: 

Db 517 PVEPD-PMLSFAPAAALQSRPEPSSGDGASVPEPPRSQQQKPEEEAVSSSQSPTATEIPG 575 

Qy 971 LSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALAL 1018 

: I : I I I I I I II I : I I : II I : I II I I I I I : M I II : I I I 
Db 576 PLGSGLMPPLPFFNKQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAA 635 

Qy 1019 LSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHWSTIKELR 1078 

II I I I I II I I I : I I : I I : I I I I I I : I I I I I : : I : I : I I I : : : I I I I : I I I I 

Db 636 LSATISFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELR 695 

Qy 1079 RLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLG 1138 

I I I I I I I I I I I I I I I I I I : I I I I I I I I II I I I : : I : : I : I : : I I : I : I I I : I III 
Db 696 RLFLVQDLVDSLKFAVLMWLLTYVGALFNGLTLLLMAWSMFTLPVVYVKHQAQVDQYLG 755 

Qy 1139 LANKSVKDAMAKIQAKI PGLKRKAD 1163 

I : : I I I I I I I I I I I I : 

Db 756 LVRT H I N TWAK I Q AK I P GAKRHAE 7 80 

Search completed: September 29, 2004, 18:19:39 
Job time : 98.4948 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table : 



September 29, 2004, 18:04:48 ; Search time 16.6724 Seconds 

(without alignments) 
3632.211 Million cell updates/sec 

US-09-830-972-2 
5848 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKI PGLKRKAD 1163 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database 



SwissProt 42:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 




Description 


1 


5848 


100. 


0 


1163 


1 


RTN4 


RAT 


Q9jkll 


rattus norv 


2 


4403.5 


75. 


3 


1192 


1 


RTN4_ 


_HUMAN 


Q9nqc3 


homo sapien 


3 


917 


15. 


7 


199 


1 


RTN4_ 


_M0USE 


Q99p72 


mus musculu 


4 


801 


13. 


7 


777 


1 


RTN1_ 


_RAT 


Q64548 


rattus norv 


5 


789.5 


13. 


5 


776 


1 


RTN1_ 


_HUMAN 


Q16799 


homo sapien 


6 


625. 5 


10. 


7 


236 


1 


RTN3_ 


_HUMAN 


095197 


homo sapien 


7 


625.5 


10. 


7 


237 


1 


RTN3_ 


_MOUSE 


Q9es97 


mus musculu 


8 


509 


8. 


7 


471 


1 


RTN2_ 


_MOUSE 


070622 


mus musculu 


9 


490 


8. 


4 


545 


1 


RTN2_ 


_HUMAN 


075298 


homo sapien 


10 


308.5 


5. 


3 


5147 


1 


PCLO" 


HUMAN 


Q9y6v0 


homo sapien 


11 


300 


5. 


1 


5120 


1 


PCLO" 


CHICK 


Q9pu36 


gallus gall 


12 


295.5 


5. 


1 


865 


1 


CPN_DROME 


Q02910 


drosophila 


13 


291.5 


5. 


0 


2459 


1 


MAPB_ 


_RAT 


P15205 


rattus norv 


14 


288 


4. 


9 


2468 


1 


mapb" 


_HUMAN 


P46821 


homo sapien 


15 


284 


4. 


9 


2464 


1 


mapb~ 


MOUSE 


P14873 


mus musculu 


16 


282.5 


4. 


8 


2805 


1 


mapa" 


_HUMAN 


P78559 


homo sapien 


17 


279 


4. 


8 


5038 


1 


PCLO~ 


_MOUSE 


Q9qyx7 


mus musculu 



18 


277 


4. 


.7 


3644 


1 


MINT_MOUSE 


Q62504 


mus musculu 


19 


275.5 


4. 


.7 


3924 


1 


ANK2 HUMAN 


Q01484 


homo sapien 


20 


275 


4. 


.7 


1972 


1 


P5 3 INHUMAN 


Q12888 


homo sapien 


21 


272.5 


4. 


.7 


3664 


1 


MINT_HUMAN 


Q96t58 


homo sapien 


22 


272 


4, 


,7 


5085 


1 


PCLO^RAT 


Q9jks6 


rattus norv 


23 


267 


4, 


,6 


1781 


1 


AK12_HUMAN 


Q02952 


homo sapien 


24 


265.5 


4, 


,5 


1828 


1 


MAP 2 MOUSE 


P20357 


mus musculu 


25 


261.5 


4, 


,5 


1087 


1 


NFHJVtOUSE 


P19246 


mus musculu 


26 


259.5 


4. 


.4 


1026 


1 


NFH HUMAN 


P12036 


homo sapien 


27 


259.5 


4, 


,4 


2404 


1 


SONJ40USE 


Q9qx47 


mus musculu 


28 


258.5 


4, 


,4 


1861 


1 


MAP2__RAT 


P15146 


rattus norv 


29 


257.5 


4. 


.4 


2715 


1 


MLL4_HUMAN 


Q9umn6 


homo sapien 


30 


256 


4, 


.4 


1723 


1 


AIM1_HUMAN 


Q9y4kl 


homo sapien 


31 


255.5 


4, 


.4 


1140 


1 


YM96_YEAST 


Q04893 


saccharomyc 


32 


254.5 


4. 


.4 


6632 


1 


UN89_CAEEL 


001761 


caenorhabdi 


33 


254 


4, 


.3 


1616 


1 


P2 00_MYCGE 


Q49429 


mycoplasma 


34 


254 


4, 


,3 


3381 


1 


PGCV_BOVIN 


P81282 


bos taurus 


35 


253 


4. 


.3 


3421 


1 


TEGU_HSVEB 


P28955 


equine herp 


36 


252.5 


4, 


,3 


1411 


1 


TCOF_HUMAN 


Q13428 


homo sapien 


37 


252 


4. 


.3 


2738 


1 


PGCV_RAT 


Q9erb4 


rattus norv 


38 


250.5 


4. 


.3 


4377 


1 


ANK3 HUMAN 


Q12955 


homo sapien 


39 


248.5 


4, 


,2 


1189 


1 


YJH6_YEAST 


P47035 


saccharomyc 


40 


248 


4. 


.2 


1827 


1 


MAP2_HUMAN 


P11137 


homo sapien 


41 


245.5 


4. 


.2 


1001 


1 


IF2 SYNY3 


P72689 


synechocyst 


42 


244 


4, 


,2 


1338 


1 


ACIN_MOUSE 


Q9jix8 


mus musculu 


43 


241.5 


4. 


.1 


8545 


1 


ANC1_CAEEL 


Q9n4m4 


caenorhabdi 


44 


238.5 


4, 


. 1 


831 


1 


NFH_RAT 


P16884 


rattus norv 


45 


238 .5 


4. 


. 1 


1341 


1 


ACIN_HUMAN 


Q9ukv3 


homo sapien 



ALIGNMENTS 



RESULT 1 
RTN4_RAT 

ID RTN4_RAT STANDARD; PRT; 1163 AA. 

AC Q9JK11; Q9JK10; Q9R0D9; Q9WUE9; Q9WUF0 ; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 

DE (Glut 4 vesicle 20 kDa protein) . 

GN RTN4 OR NOGO. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 3), AND PARTIAL SEQUENCE. 

RC STRAIN=Sprague-Dawley; TISSUE=Adipocyte; 

RX MEDLINE=99249816; PubMed=10231557 ; 

RA Morris N.J., Ross S.A., Neveu J.M., Lane W.S., Lienhard G.E.; 

RT "Cloning and characterization of a 22 kDa protein from rat adipocytes: 

RT a new member of the reticulon family."; 

RL Biochim. Biophys . Acta 1450:68-76(1999). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 



RX MEDLINE=20129258; PubMed=10 6677 96 ; 

RA Chen M.S., Huber A.B., Van der Haar M.E., Frank M. , Schnell L., 

RA Spillmann A.A. , Christ F., Schwab M.E.; 

RT "Nogo-A is a myelin-associated neurite outgrowth inhibitor and an 

RT antigen for monoclonal antibody IN-l." ; 

RL Nature 403:434-439(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 4) . 

RC STRAIN=Wistar Kyoto; TISSUE=Vascular smooth muscle; 

RA Ito T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in rat: one of two 

RT minor splice variants."; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP FUNCTION. 

RX MEDLINE=2 2 033691 ; PubMed=12037567 ; 

RA GrandPre T., Li S., Strittmatter S.M.; 

RT "Nogo-66 receptor antagonist peptide promotes axonal regeneration."; 

RL Nature 417:547-551(2002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 
CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 
CC membrane of the endoplasmic reticulum through 2 putative 

CC transmembrane domains (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Name=l; Synonyms=Nogo-A, NI-220-250; 

CC IsoId=Q9 JK11-1 ; Sequence=Displayed; 

CC Name =2 ; Synonyms=Nogo-B, Foocen-Ml; 

CC IsoId=Q9JKll-2; Sequence=VSP_005658 ; 

CC Name=3; Synonyms=Nogo-C, VP20; 

CC IsoId=Q9JKll-3; Sequence=VSP_005656, VSP_005657; 

CC Name=4; Synonyms=Foocen-M2 ; 

CC IsoId=Q9JKll-4; Sequence=VSP_005659 ; 

CC -!- TISSUE SPECIFICITY: Isoforms 1, 2 and 3 are present in optic 
CC nerve, spinal cord and cerebral cortex. Isoforms 1 and 2 are 

CC present in dorsal root ganglion, sciatic nerve and PC12 cells 

CC after longer exposure. Isoforms 2 and 3 are detected in kidney, 

CC cartilage, skin, lung and spleen. Isoform 3 is expressed at high 

CC level in skeletal muscle. In adult animals isoform 1 is expressed 

CC mainly in the nervous system. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF051335; AAF01564.1; -. 

DR EMBL; AJ242961; CAB71027.1; 

DR EMBL; AJ242962; CAB71028.1; 



DR 


EMBL; AJ242963; 


CAB71029. 1; 






DR 


EMBL; AF132045; 


AAD31019. 1; 






DR 


EMBL; AF132046; 


AAD31020. 1; 






DR 


GO; GO: 0030176; 


C: 


integral 


to endoplasmic reticulum membrane 


DR 


GO; GO:0005635; 


C: 


nuclear membrane; ISS. 




DR 


GO; GO: 0005515; 


F: 


protein binding; ISS. 




DR 


GO; GO:0019987; 


P: 


negative 


regulation of anti-apoptosis ; 


ISS 


DR 


GO; GO:0030517; 


P: 


negative 


regulation of axon extension; 


ISS 


DR 


InterPro; 


IPR00338 


:8; Reticulon. 




DR 


Pfam; PF02453; Reticulon; 1 






DR 


PROSITE; 


PS50845; 


RETICULON; 1. 




KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane. 


FT 


DOMAIN 


1 




989 


CYTOPLASMIC (Potential) . 




FT 


TRANSMEM 


990 




1010 


POTENTIAL. 




FT 


DOMAIN 


1011 




1104 


LUMENAL (Potential) . 




FT 


TRANSMEM 


1105 




1125 


POTENTIAL. 




FT 


DOMAIN 


1126 




1163 


CYTOPLASMIC (Potential) . 




FT 


DOMAIN 


976 




1163 


RETICULON. 




FT 


DOMAIN 


33 




46 


POLY-GLU. 




FT 


DOMAIN 


73 




76 


POLY- ALA. 




FT 


DOMAIN 


140 




145 


POLY- PRO. 




FT 


VARSPLIC 


1 




964 


Missing (in isoform 3) . 




FT 










/FTId=VSP 005656. 




FT 


VARSPLIC 


965 




975 


AVLSAELSKTS -> MDGQKKHWKDK 


(in 


FT 










3) . 




FT 










/FTId=VSP_005657. 




FT 


VARSPLIC 


173 




975 


Missing (in isoform 2). 




FT 










/FTId=VSP_005658. 




FT 


VARSPLIC 


192 




975 


Missing (in isoform 4) . 




FT 










/FTId=VSP 005659. 




FT 


CONFLICT 


1130 




1131 


MISSING (IN REF. 3; AAD31020) . 


SQ 


SEQUENCE 


1163 


AA; 126386 


! MW; 8CB894B09E94F0B6 CRC64; 





Query Match 100.0%; Score 5848; DB 1; Length 1163; 

Best Local Similarity 100.0%; Pred. No. 3.9e-213; 

Matches 1163; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


i 


MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 


60 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 I 1 1 1 I 1 1 1 1 1 




Db 


i 


MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 


60 


QY 


61 


PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 


120 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 




Db 


61 


PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAP PAAPERQPSWERSPAAPAPSLP 


120 


Qy 


121 


PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 I 1 1 1 1 II 1 1 1 1 1 1 




Db 


121 


PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 


180 


Qy 


181 


AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 


240 




1 M 1 II 1 1 II 1 1 1 1 II 1 1 1 1 1 II II 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 




Db 


181 


AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 


240 


Qy 


241 


GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 


300 




1 I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 


300 



Qy 301 AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMS WA 360 

I I I I I I I I I I I I I II I I I I I I I 1 I I I I I I I I I I I I 1 I I I II I I I 1 I I I I II I I I I I I I I I 
Db 301 AI LVENT KEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRVVS PEKTMDI FN EMQMSVVA 360 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I 
Db 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

Qy 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I II I I I I I I I I I II I I! I I 
Db 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Qy 4 81 QIITEKTSPKTSNPFLVAVQDSEADYWTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

Qy 541 NEATGTKIAYETWDLVQTSEAIQESLYPTAQLCPSFEE^VEATPSPVLPDIVMEAPLNSL 600 

I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEE^MNVALKALGTKEGIKEPESF 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I i I II I II I I 
Db 601 LPSAGASVVQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

I I I I I I I I I II I I I I I I I I I I I I I I I i II I I I I I I I I II I I I I I I I I I I M I I I I I I I I I 
Db 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

I I I I I I li I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I II I I I I I II I I I I I I I II 
Db 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I I I 
Db 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Qy 901 DEVHVSDEFSENRSSVS KASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

I I I I I I I M I I I I II I I I I I I I I I I I I I I I 1 I I II II I I I I I I I I II I I I I I I 1 I I I I I I 
Db 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

Qy 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

I II I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I II 
Db 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

Qy 1081 FLVDDLVDSLKFAVLMWVFT YVGALFNGLTLLI LALI S LFS I PVI YERHQVQI DHYLGLA 1140 

I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I I II I II I I I 
Db 1081 FLVDDLVDSLKFAVXMWVFTWGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLA 1140 



Qy 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 



Db 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 



RESULT 2 
RTN4_HUMAN 

ID RTN4_HUMAN STANDARD; PRT; 1192 AA. 

AC Q9NQC3; 094962; Q9BXG5; Q9H212; Q9H3I3; Q9UQ42; Q9Y293; Q9Y2Y7; 

AC Q9Y5U6; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 

DE (Neuroendocrine-specif ic protein) (NSP) (Neuroendocrine specific 

DE protein C homolog) (RTN-x) (Reticulon 5) (My043 protein) . 

GN RTN4 OR NOGO OR ASY OR KIAA088 6. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RX MEDLINE=20129242; PubMed=10667780 ; 

RA Prinjha R. , Moore S.E., Vinson M. , Blake S., Morrow R. , Christie 

RA Michalovich D., Simmons D.L., Walsh F.S.; 

RT "Inhibitor of neurite outgrowth in humans."; 

RL Nature 4 03:383-384(2000). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2) . 

RC TISSUE=Brain; 

RX MEDLINE=2 101069 6; PubMed=1112 6360 ; 

RA Tagami S., Eguchi Y., Kinoshita M. , Takeda M. , Tsujimoto Y. ; 

RT "A novel protein, RTN-XS, interacts with both Bcl-XL and Bcl-2 on 

RT endoplasmic reticulum and reduces their anti-apoptotic activity." 

RL Oncogene 19:5736-5746(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RX MEDLINE=20237 542; PubMed=1077 3 68 0 ; 

RA Yang J., Yu L., Bi A.D., Zhao S.-Y.; 

RT "Assignment of the human reticulon 4 gene (RTN4) to chromosome 

RT 2pl4-->2pl3 by radiation hybrid mapping."; 

RL Cytogenet. Cell Genet. 88:101-102(2000). 

RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM 4) . 

RA Jin W. -L. , Ju G. ; 

RT "Developmentally-regulated alternative splicing in a novel Nogo-A 

RL Submitted (NOV-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3) . 

RC TISSUE=Placenta, and Skeletal muscle; 

RA Ito T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in human."; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM 2) . 

RC TISSUE=Fibroblast; 

RA Yutsudo M. ; 



RT "Isolation of a cell death-inducing gene."; 

RL Submitted (JUN-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [7] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 

RC TISSUE-Pituitary; 

RA Song H. , Peng Y., Zhou J., Huang Q. , Dai M. , Mao Y.M., Yu Y. , Xu X., 

RA Luo B., Hu R,, Chen J.; 

RT "Human neuroendocrine-specif ic protein C (NSP) homolog gene."; 

RL Submitted (JUL-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [8] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 

RA Gu J.R., Wan D.F., Zhao X.T., Zhou X.M., Jiang H.Q., Zhang P.P., 

RA Qin W.X., Huang Y., Qiu X.K., Qian L.F., He L.P., Li H.N., Yu Y., 

RA Yu J., Han L.H. ; 

RT "Novel human cDNA clone with function of inhibiting cancer cell 

RT growth."; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [9] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC TISSUE=Brain; 

RX MEDLINE=99156230; PubMed=1004 84 8 5 ; 

RA Nagase T., Ishikawa K.-I., Suyama M. , Kikuno R. , Hirosawa M. , 

RA Miyajima N., Tanaka A., Kotani H., Nomura N. , Ohara 0.; 

RT "Prediction of the coding sequences of unidentified human genes. XII. 

RT The complete sequences of 100 new cDNA clones from brain which code 

RT for large proteins in vitro."; 

RL DNA Res. 5:355-364(1998). 

RN [10] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3) . 

RC TISSUE=Brain, Ovary, Pancreas, Placenta, and Skeletal muscle; 

RX MEDLINE=22388257; PubMed=12 477 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. f Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L . , Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Browns tein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [11] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 

RX MEDLINE=20499367; PubMed=11042 152 ; 

RA Zhang Q.-H., Ye M. , Wu X.-Y., Ren S.-X., Zhao M. , Zhao C.-J., Fu G., 

RA Shen Y., Fan H.-Y., Lu G. , Zhong M. , Xu X.-R., Han Z.-G,, Zhang J.-W., 

RA Tao J., Huang Q.-H., Zhou J., Hu G.-X., Gu J. , Chen S.-J., Chen Z.; 



RT "Cloning and functional analysis of cDNAs with open reading frames for 

RT 300 previously undefined genes expressed in CD34+ hematopoietic 

RT stem/progenitor cells."; 

RL Genome Res. 10:1546-1560(2000). 

RN [12] 

RP SEQUENCE OF 482-1192 FROM N.A. (ISOFORM 1/4). 

RC TISSUE=Brain; 

RA Mao Y.M., Xie Y., Zheng Z.H.; 

RL Submitted (MAY-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [13] 

RP SEQUENCE OF 186-1192 FROM N.A. (ISOFORM 1). 

RC TISSUE=Testis; 

RA Sha J.H., Zhou Z.M., Li J.M. ; 

RL Submitted (JAN-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [14] 

RP TOPOLOGY. 

RC TISSUE=Brain; 

RX MEDLINE=2 0129259; PubMed=l 0 667797; 

RA GrandPre T . , Nakamura F. , Vartanian T., Strittmatter S.M.; 

RT "Identification of the Nogo inhibitor of axon regeneration as a 

RT Reticulon protein."; 

RL Nature 403:439-444(2000). 

RN [15] 

RP FUNCTION. 

RC TISSUE=Brain; 

RX MEDLINE=21069055; PubMed=112 017 42 ; 

RA Fournier A.E., Grandpre T., Strittmatter S.M.; 

RT "Identification of a receptor mediating Nogo-66 inhibition of axonal 

RT regeneration."; 

RL Nature 409:341-34 6(2001). 

RN [16] 

RP REVIEW. 

RX MEDLINE=2 188 8 956; PubMed=l 1 891768; 

RA Ng C.E.L., Tang B.L.; 

RT "Nogos and .the Nogo-66 receptor: factors inhibiting CNS neuron 

RT regeneration . " ; 

RL J. Neurosci. Res. 67:559-565(2002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 

CC block the regeneration of the nervous central system in adults. 

CC Isoform 2 reduces the anti-apoptotic activity of Bcl-xl and Bcl-2. 

CC This is likely consecutive to their change in subcellular 

CC location, from the mitochondria to the endoplasmic reticulum, 

CC after binding and sequestration. 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2 . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum. Anchored to the membrane of the endoplasmic reticulum 

CC through 2 putative transmembrane domains. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Name=l; Synonyms=RTN 4A, Nogo-A, RTN-xL; 

CC IsoId=Q9NQC3-l; Sequence=Displayed; 

CC Name=2; Synonyms=RTN 4B, Nogo-B, RTN-xS, Foocen-M; 

CC IsoId=Q9NQC3-2; Sequence=VSP_005655 ; 

CC Name=3; Synonyms=RTN 4C, Nogo-C, Foocen-S; 

CC IsoId=Q9NQC3-3; Sequence=VSP_005652 , VSP_005653; 

CC Name=4; 

CC IsoId-Q9NQC3-4; Sequence=VSP__005654 ; 
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-!- TISSUE SPECIFICITY: Isoform 1 is specifically expressed in brain 
and testis and weakly in heart and skeletal muscle. Isoform 2 is 
widely expressed excepted for the liver. Isoform 3 is expressed in 
brain, skeletal muscle and adipocytes. Isoform 4 is testis- 
specif ic . 

-!- SIMILARITY: Contains 1 reticulon domain. 

-!- CAUTION: Ref.ll sequence differs from that shown due to 
frameshifts in positions 1149 and 1156. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 
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AJ251383 
AJ251384 
AJ251385 
AB040462 
AB040463 
AF148537 
AF148538 
AF087901 
AF32 0999 
AF132047 
AF132048 
AB015639 
AF077050 
AF177332 
AB020693 
BC001035 
BC007109 
BC014366 



CAB99248 
CAB99249 
CAB99250 
BAB18927 
BAB18928 
AAG12176. 1 
AAG12177.1 
AAG12205 
AAG40878 
AAD31021 
AAD31022 
BAA83712 
AAD27783 
AAG17976 
BAA74909 
AAH01035 
AAH07109 



AAH14366. 1 



Query Match 75.3%; Score 4403.5; DB 1; 

Best Local Similarity 75.9%; Pred. No. 9.1e-159; 
Matches 909; Conservative 104; Mismatches 145; 



Length 1192; 
Indels 39; Gaps 



20; 



QY 
Db 



1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I :: I I I I I II I I I I I I 
1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 



QY 
Db 



61 PAAGLSAAAVP — P AAAAP LLD F S S D S VP PAP RG P L P AAP PAAP E RQ P S WERS PAA P 115 

I I I I II I I II III llhll : I I I I I I I I I II I I I I I I I I I I I I : II : I 
59 P AAGL S AAP VPT APAAGAP LMD FGND FVP PAP RG P L PAAP P VAP ERQ P S WD P S P VS S T VP 118 



QY 
Db 

Qy 

Db 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP KR 166 

Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I II I 

119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 



167 



225 



RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 

I I I II I I I I I I I I I I I I I I I I I I I I III I I I I I I : I : I I I I I I I I I I II I I I I I I 
17 9 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 2 37 



QY 



226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 



Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGS S FKGS PKGESAI LVENTKEEVI VRS KDKED-LVCSAALHS PQES PVG -KED 339 

I I I I I I I I I I I I I : : I I : I I : I I : : I I : I : I I : I I : I i I III 

Db 298 SEMGS S FSVSPKAESAVIVANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

I I I I I I I I I ::: I I I : I I I I I I I I I I I : I I I I I : i I I : I I I : I : 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTT 454 

11111:11 Mill: I I I I I : I I I I I I 1 I : I I I I I I I I I I : I II I 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I I I I I I I I I I I I II I I I I 1 : I I I I : I I I II I I I I I I I I I I! I I I I 1 I I I I : 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I I I I : I I I I I I I I I I I I I I I I I I I I I 1111111111:1111111 : I I I I I I III 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 57 4 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I : I I I I I I I I I : I : I I I I I I I 

Db 5 97 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNT^AVQETEAPYISIACDLIKETKLSTEPSPDF 692 

I I I I I I I I : I : I I : I II Mill: I I I : I I I I I I I I I I I I I I I II I I I I I I : I I I 

Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 7 52 

I : I I I : I I I : I I : I : I I I I M I I : M I I I I I I I I I I I : I I I hi I I I : I I I II I I 

Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I : : : : : : I : I I I I I I I I I I I I I : : I : I I I : : : I I : I I I I I II I II 

Db 77 6 FESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 

Qy 810 NTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKL7VKEYTDLEVS 8 68 

: I I : I I I I I I III : I : I : I I I I I I I I I I I I I I I I I : I : I II I I t : I I I I I I I I 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNI YPK — DEVHVSDEFSENRSSVSKASISPSNV 926 

I I I I I I II I I I I I I I I I I I I I I I : : : I I : I I : I I : I I : I : I 

Db 8 96 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

III I I : I I I I I I I I I I I I I II I I I I I I I I II: I I I I I I I I I I I II I I I I I I 

Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1046 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 1016 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1075 



Qy 1047 YLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I II I I I I I I I I I II I I I I I I I 



Db 



107 6 YLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 



Qy 1107 NGLTLLI LALI SLFSI PVI YERHQVQIDHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I I I I I I I I I I : I I II I I I I I I I II I I I I I I : I I I I I I I I I I I II I I I I I I : 
Db 1136 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKWKDAMAKIQAKIPGLKRKAE 1192 



RESULT 3 
RTN4_MOUSE 

ID RTN4_MOUSE STANDARD; PRT; 199 AA. 

AC Q99P72; Q9CTE3; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) . 

GN RTN4 OR NOGO. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=3T3-L1; TISSUE=Adipocyte ; 

RA Coulson A.C., Craggs P.D., Morris N.J-; 

RT "Mouse vp20/RTN4C cDNA. " ; 

RL Submitted (DEC-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 170-199 FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Embryo; 

RX MEDLINE=2 1085660; PubMed=11217851 ; 

RA Kawai J., Shinagawa A., Shibata K., Yoshino M. , Itoh M. , Ishii Y., 

RA Arakawa T., Hara A., Fukunishi Y., Konno H., Adachi J., Fukuda S., 

RA Aizawa K., Izawa M. , Nishi K. , Kiyosawa H., Kondo S., Yamanaka I., 

RA Saito T., Okazaki Y. f Gojobori T., Bono H., Kasukawa T,, Saito R. , 

RA Kadota K., Matsuda H.A., Ashburner M. , Batalov S., Casavant T., 

RA Fleischmann W., Gaasterland T . , Gissi C, King B. , Kochiwa H., 

RA Kuehl P., Lewis S., Matsuo Y. , Nikaido I., Pesole G. , Quackenbush J., 

RA Schriml L.M., Staubli F. , Suzuki R. , Tomita M. , Wagner L., Washio T., 

RA Sakai K., Okido T., Furuno M. , Aono H., Baldarelli R. , Barsh G., 

RA Blake J., Boffelli D., Bojunga N., Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C, Fletcher C, Fujita M. , Gariboldi M. , 

RA Gustincich S., Hill D. , Hofmann M. , Hume D.A., Kamiya M. , Lee N.H., 

RA Lyons P., Marchionni L., Mashima J., Mazzarelli J., Mombaerts P., 

RA Nordone P., Ring B., Ringwald M. , Rodriguez I., Sakamoto N., 

RA Sasaki H. , Sato K. , Schoenbach C, Seya T., Shibata Y . , Storch K.-F., 

RA Suzuki H., Toyo-oka K. , Wang K.H., Weitz C, Whittaker C, Wilming L., 

RA Wynshaw-Boris A., Yoshida K., Hasegawa Y., Kawaji H., Kohtsuki S., 

RA Hayashizaki Y.; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 4 09:685-690(2001). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 
CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 
CC membrane of the endoplasmic reticulum through 2 putative 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



transmembrane domains (By similarity) . 
ALTERNATIVE PRODUCTS: 

Event=Alternative splicing; Named isoforms=l; 

Comment=A number of isoforms may be produced; 
Name=l; 

IsoId=Q99P72-l; Sequence=Displayed; 
SIMILARITY: Contains 1 reticulon domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF326337; AAK08076.1; -. 

EMBL; AK003859; -; NOT_ANNOTATED_CDS . 

MGD; MGI: 1915835; Rtn4 . 

783; C: endoplasmic reticulum; IDA. 

17 6; C: integral to endoplasmic reticulum membrane; ISS. 
635; C: nuclear membrane; ISS. 
515; F:protein binding; ISS. 

987; P: negative regulation of anti-apoptosis ; ISS. 
517; P: negative regulation of axon extension; ISS. 
399; P : neurogenesis ; IDA. 
PR003388; Reticulon. 
53; Reticulon; 1. 
5084 5; RETICULON; 1. 
reticulum; Alternative splicing; Transmembrane. 



GO 
GO 
GO 
GO 
GO 
GO 
GO 



GO: 0005 
GO: 0030 
GO: 0005 
GO: 0005 
GO: 0019 
GO: 0030 
GO: 0007 
InterPro; I 
Pfam; PF024 
PROSITE; PS 
Endoplasmic 
DOMAIN 
TRANSMEM 
DOMAIN 
TRANSMEM 
DOMAIN 
DOMAIN 
SEQUENCE 



1 


25 


CYTOPLASMIC (Potential) . 


26 


50 


POTENTIAL. 


51 


137 


LUMENAL (Potential) . 


138 


162 


POTENTIAL. 


163 


199 


CYTOPLASMIC (Potential) . 


12 


199 


RETICULON. 


199 AA; 


22466 


MW; 07BE5D580059ED9C CRC64; 



Query Match 15.7%; 
Best Local Similarity 97.9%; 
Matches 187; Conservative 



Score 917; DB 1; Length 199; 
Pred. No. 1.5e-28; 
1; Mismatches 3; Indels 0; 



Gaps 



0; 



QY 



Db 



973 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1032 

I I I I I I I I I I II I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
9 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 68 



Qy 

Db 

Qy 

Db 

QY 
Db 



1033 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 1092 
I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I II I I I II I I I I I I I I I 
69 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 12 8 

1093 AVLMWVFT YVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQ 1152 

I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I II I I II I I I I M I I I I I I I 
129 AVXMWVFT YVGALFNGLTLLI LALI SLFS I PVI YERHQAQI DHYLGLANKSVKDAMAKIQ 188 

1153 AKI PGLKRKAD 1163 
I I I I I I I I I I : 
189 AKI PGLKRKAE 199 



RESULT 4 
RTN1 RAT 



ID RTN1_RAT STANDARD; PRT; 77 7 AA. 

AC Q64548; Q64547; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-specif ic protein) (S-rex) . 

GN RTN1 OR NSP. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTNl-B AND RTNl-S) . 

RC STRAIN=Wistar; TISSUE=Brain cortex; 

RX MEDLINE=96386034; PubMed=8 7 93 8 64 ; 

RA Baka I.D., Ninkina N.N., Pinon L.G.P., Adu J., Davies A.M., 

RA Georgiev G.P., Buchman V.L.; 

RT "Intracellular compartmentalization of two differentially spliced s- 

RT rex/NSP rnRNAs in neurons."; 

RL Mol. Cell. Neurosci. 7:289-303(1996). 

CC -!- FUNCTION: May be involved in neuroendocrine secretion or in 

CC membrane trafficking in neuroendocrine cells. 

CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane (By 

CC similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=RTNl-B; Synonyms=S-RexB; 

CC IsoId=Q64548-l; Sequence=Displayed; 

CC Name=RTNl-S ; Synonyms=S-RexS ; 

CC IsoId=Q64548-2; Sequence=VSP_005647 , VSP_005648; 

CC -!- TISSUE SPECIFICITY: EXPRESSED PREDOMINANTLY IN CENTRAL AND 

CC PERIPHERAL NERVOUS SYSTEM OF NEWBORN AND ADULT RATS. LOW LEVELS 

CC HAVE BEEN ALSO DETECTED IN HEART, ADRENAL GLAND AND SPLEEN. 

CC EXPRESSION OF ISOFORM RTNl-B IS RESTRICTED TO PARTICULAR NEURONAL 

CC TYPES. 

CC -!- DEVELOPMENTAL STAGE: DETECTED ON EMBRYONIC DAY E10 IN THE 

CC HINDBRAIN AND IN Ell IN THE FOREBRAIN . DURING THE NEXT 3 EMBRYONIC 

CC DAYS THE LEVELS OF S-REXS INCREASES AND REMAINS STABLE AT E13 IN 

CC THE HINDBRAIN AND AT E14 IN THE FOREBRAIN. THE LEVELS OF S-REXB 

CC DOES NOT CHANGE AS SIGNIFICANTLY DURING DEVELOPMENT OF THE 

CC HINDBRAIN. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U17604; AAC53046.1; -. 

DR EMBL; U17603; AAC53045.1; -. 



DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 

Endoplasmic reticulum; Alternative splicing; Transmembrane. 



TRANSMEM 

TRANSMEM 

DOMAIN 

DOMAIN 

VARSPLIC 

VARSPLIC 



SEQUENCE 



604 624 POTENTIAL . 

727 747 POTENTIAL. 
590 777 RETICULON. 
610 613 POLY-LEU. 

1 569 Missing (in isoform RTN1-S) . 

/FTId=VSP_005647. 
570 589 I PGPLGSDLVPPLPFFNKQK -> MQATADSTKMDCVWSNW 

KSQ (in isoform RTN1-S) . 

/FTId=VSP_005648. 
777 AA; 83001 MW; AF7479C50F28D0AC CRC64; 



Query Match 13.7%; Score 801; DB 1; Length 777; 

Best Local Similarity 32.4%; Pred. No. 2e-23; 

Matches 258; Conservative 109; Mismatches 264; Indels 166; Gaps 32; 

Qy 485 EKT S P KT SN P FLVAVQD S E ADYVTTDTL SKVTEAAVSNMPEGL — TPD 530 

I : : i I : I I I I I I I : : I : : I : I : I I 

Db 2 9 EEATPKGARP AQQDGEPAWGSGAGAGWSSRGLCSGPARSPPVAMETASTGVAAVPD 85 

Qy 531 LVQEACESELNEATG TKIAYETKVDLVQTSEAIQ-ESLYPTAQLCPSFEEAEA 582 

: : I : I : I I : I : : I I : : I I I 
Db 86 ALDHSSSPTLKDGEGACYTSLISDICYPPREDSAYFTGILQKENGHITTSESP EELG 142 



Qy 583 TPSPVLPDIVMEAPLNSLLPSAGASWQPS VSPLEAPPPVSYDSIKLE 630 

Mill:: |:||| : I : | : : | | I : I I 

Db 143 TPGPSLPEVPGTEP-HGLLSSDSGIEMTPAESTEVNKILADP LDQMKAEACKYIDIT 198 



Qy 631 PENPPPYEEAMNVALK ALGTK-EGIKEPE SFNAAVQETE 668 

I | I : : | : I I II : I I : I 

Db 199 RPQEAKGQEEQSPGLEDKDLDFKDKDSEVSTKPEGVHAPNQPSPVEGKLIKDNLFEESTF 258 



Qy 669 APYISIACDLIKETKLSTEPSPDFSNYSEI AKFEKSVPEHAELVEDSSPESEPVD 723 

I II I I :: ::| :| :M :::|| :| ||:: I 

Db 259 APYIDELSD — EQHRMSLVTAPVKITLTEIGPPVMTATHETIPEKQDLCLKPSPDTVPTV 316 



Qy 724 LFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK EERLSASPQELGKPYLESF 778 

I : I : : I I I I : I I : I : : I : I I : 

Db 317 TVSE PEDDSPGSVTPPSSGTEPSAAESQGKGSVSEDELIAAIKEAKGLSYETTE 370 

Qy 779 QPN LHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKIKESETF 833 

I | | : : | | : I I I I I II 

Db 371 S P RP VGQAAD RP KVKARS GL P T I P S SLDQEAS SAESGDSEIELV 414 

Qy 834 SDSSPIEIIDEFPT FVSAKDDSPKLAK-EYTDLEVSDKSEIANIQSGADSLPCL 886 

I : I : II: II II : I : I : : I : II : 

Db 415 S E- DPMAS EDALP S GYVS FGHVS GP P P S PAS P S I Q YS I LREEREAEL DSELII 466 

Qy 887 ELPCDL-SFKNIYPKDEVH VSDEFS ENRS SVSKAS ISP SNVSALEPQTEM 935 

I II I Ml II III: : I: :M : 

Db 4 67 E- SCDAS SAS EES PKREQDS P PMKPGVLDAI REETS SRATEERAPSHQGPVEPDP I LS FT 525 

Qy 936 GSIV KSKSLTKEAE KKLPSDTEKEDRSLSAVLSAEL — 971 

I : I II: I I : I : I I : I : : I 



Db 



526 PVTLQSRPEPSSGDGAPVPEPPKSQQQKPEEEAVSSSQSPAATE IPGPLGSDLVP 580 



Qy 972 SKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFR 1026 

: I : I I I I I I I II : I I : I I 1 : I II III I I : I I I I I : I I I II I I I I I 
Db 581 PLPFFNKQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYIALAALSATISFR 640 

Qy 1027 I YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDL 1086 

III I : I I : I I : I I I I I I : I II I I : : I : I : I I I : : : I I I I : II I I I I I I I II 

Db 641 I YKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDL 700 

Qy 1087 VDSLKFAVLMWFTWGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKD 1146 

I I I I I I I I I I I : I I I I I I I I M I I I : : I : : I : I : : I I : I : I I I : I I I I I : 
Db 701 VDSLKFAVLMWLLTYVGALFNGLTLLLMAWSMFTLPWYVKHQAQVDQYLGLVRTHINT 760 

Qy 1147 AMAKIQAKI PGLKRKAD 1163 

: II I I I I I I I II I : 
Db 7 61 WAKIQAKIPGAKRHAE 77 7 



RESULT 5 
RTN1_HUMAN 

ID RTN INHUMAN STANDARD; PRT; 77 6 AA. 

AC Q16799; Q16800; Q16801; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-specif ic protein) . 

GN RTN1 OR NSP. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A. (ISOFORMS RTN1-A; RTN1-B AND RTNl-C) . 

RC TISSUE=Lung carcinoma; 

RX MEDLINE=93293865; PubMed-7685762 ; 

RA Roebroek A.J.M., Van de Velde H.J.K., Van Bokhoven A., Broers J.L.V., 

RA Ramaekers F.C.S., Van de Ven W.J.M.; 

RT "Cloning and expression of alternative transcripts of a novel 

RT neuroendocrine-specif ic gene and identification of its 135-kDa 

RT translational product."; 

RL J. Biol. Chem. 2 68:13439-13447(1993). 

RN [2] 

RP ALTERNATIVE SPLICING. 

RX MEDLINE=964299 95; PubMed=88 33145 ; 

RA Roebroek A.J.M., Ayoubi T.A.Y., Van de Velde H.J.K., 

RA Schoenmakers E.F.P.M., Pauli I.G.L., Van de Ven W.J.M.; 

RT "Genomic organization of the human NSP gene, prototype of a novel gene 

RT family encoding reticulons . " ; 

RL Genomics 32:191-199(1996). 

RN [3] 

RP TISSUE SPECIFICITY. 

RX MEDLINE=98228245; PubMed=9560466 ; 

RA Hens J., Nuydens R. , Geerts H., Senden N.H., Van de Ven W.J.M., 

RA Roebroek A. J., van de Velde H.J.K., Ramaekers F.C., Broers J.L.; 

RT "Neuronal differentiation is accompanied by NSP-C expression."; 

RL Cell Tissue Res. 292:22 9-237(1998). 



CC -!- FUNCTION: May be involved in neuroendocrine secretion or in 

CC membrane trafficking in neuroendocrine cells. 

CC SUBCELLULAR LOCATION: Endoplasmic reticulum membrane. 

CC ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=RTNl-A; Synonyms=NSP-A; 

CC IsoId=Q16799-l; Sequence=Displayed; 

CC Name=RTNl-B; Synonyms=NSP-B; 

CC IsoId=Q16799-2; Sequence=VSP_005644 ; 

CC Name=RTNl-C; Synonyms=NSP-C; 

CC IsoId=Q16799-3; Sequence=VSP_005645, VSPJD05646; 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN NEURAL AND NEUROENDOCRINE TISSUES 
CC AND CELL CULTURES DERIVED THEREFROM. EXPRESSION OF ISOFORM RTN1-C 

CC IS STRONGLY CORRELATED WITH NEURONAL DIFFERENTIATION. 

CC -!- PTM: Isoforms RTN1-A and RTN1-B are phosphorylated. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L10333; AAA59950.1; -. 

DR EMBL; L10334; AAA59951.1; -. 

DR EMBL; L10335; AAA59952.1; 

DR PIR; A46583; A46583. 

DR PIR; 160904; 160904. 

DR Genew; HGNC: 104 67; RTN1 . 

DR MIM; 600865; -. 

DR GO; GO:0030176; C:integral to endoplasmic reticulum membrane; TAS . 

DR GO; GO: 0004871; F: signal transducer activity; NAS . 

DR GO; GO: 0030182; P: neuron differentiation; TAS. 

DR GO; GO: 0007165; P: signal transduction; NAS. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane; 

KW Phosphorylation. 



FT 


TRANSMEM 


603 


623 


POTENTIAL. 


FT 


TRANSMEM 


726 


746 


POTENTIAL. 


FT 


DOMAIN 


589 


776 


RETICULON. 


FT 


DOMAIN 


609 


612 


POLY-LEU. 


FT 


VARSPLIC 


1 


420 


Missing (in isoform RTN1-B) . 


FT 








/FTId=VSP_005644. 


FT 


VARSPLIC 


1 


568 


Missing (in isoform RTNl-C) . 


FT 








/FTId=VSP_005645. 


FT 


VARSPLIC 


569 


588 


GPGPLGPGAPPPLLFLNKQK -> MQATADSTKMDCVWSNW 


FT 








KSQ (in isoform RTNl-C) . 


FT 








/FTId=VSP 005646. 


SQ 


SEQUENCE 


776 AA; 


83617 


MW; CA5B6232353096FE CRC64; 



Query Match 13.5%; Score 789.5; DB 1; 

Best Local Similarity 31.2%; Pred. No. 5.3e-23; 
Matches 243; Conservative 115; Mismatches 252; 



Length 776; 

Indels 169; Gaps 



28; 



Qy 487 TSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGT 546 

: I : I I I :: : : I : | | | : | : | : 
Db 65 SGPARQSP — VAMETASTGVAGVS SAMDHTFSTTSKDGEG SCYTSLI S 110 

Qy 547 KIAYETKVDLVQTSEAIQ-ESLYPTAQLCPSFEEAEATPSPVLPDI — VMEAPLNS 599 

I I : I : : | | : : | I I I I I I I I : : | | 

Db 111 DICYPPQEDSTYFTGILQKENGHVTISESP EELGTPGPSLPDVPGIESRGLFSSDSG 167 

Qy 600 --LLPSAGASWQPSVSPLEAPPPVSY DSIKLEPENPPPYEEA M 641 

: I : I : I I : : I : : I : : : | | : 

Db 168 IEMTPAESTEVNKILADPLDQMKAEAYKYIDITRPEEVKHQEQHHPELEDKDLDFKNKDT 227 

Qy 642 NVALKALGTKEGIKEPE SFNAAVQETEAPYISIACDLIKETKLSTE-PSP 690 

: : : : I | | : : | | : : : I I I I I | | : | : : : : | 

Db 228 DISIK PEGVREPDKPAPVEGKIIKDHLLEESTFAPYID DLSEEQRRAPQITTP 280 

Qy 691 DFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLF-SDDSIPEV PQTQEEAVMLMK 745 

: I I I : : : | | : : | | | : : | | | : : 

Db 281 VKITLTEIE PSVETTTQEKTPEKQDICLKPSPDTVPTVTVSEPEDDSPGSITPP 334 

Qy 746 ESLTEVSETVAQHK EERLSASPQELGKPYLESFQP NLHSTKDAASND 792 

I I I I : I I : I : : I : I I : I I : : 

Db 335 SSGTEPSAAESQGKGSISEDELITAIKEAKGLSYETAENPRPVGQLADRPEVKARSGPPT 394 

Qy 793 IPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKI 827 

II: II:: : I I : : I I : 

Db 395 IPSPLDHEASSAESGDSEIELVSEDPM — AAEDALPSGYVSFGHVGGPPPSPASPSIQYS 452 

Qy 828 KESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEI ANIQSG 879 

: I I II I : I I I I : : I I I : : : : I I : 

Db 453 ILREEREAELDS ELIIESCDASSASEESPKREQDSPPMKPSALDAI REETGVRAEER 509 

Qy 880 ADSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIV 939 

II II INN I I I Nihil 
Db 510 APSRRGLAEPG — SFLD-YPSTEPQPGPEL PPGDGALEPETPM 549 

Qy 94 0 KSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE LSKTSWDLLYWRD 984 

II I : : I : : : | : | : I I I I I I I 

Db 550 LPRKPEEDSSSNQSPAATKGPGPLGPGAPPPLLFLNKQKAIDLLYWRD 597 

Qy 985 IKKTGWFGASLFLLLSLTVFSIVSVTAYIAIALLSVTISFRIYKGVIQAIQKSDEGHPF 1044 

I I : I I : I I I : I II III I I : II I I I : I I 1 I I I I I I M I I I : II : I I : I i I I I I 
Db 598 I KQTGI VFGS FLLLLFSLTQFS WSWAYLALAALSATI S FRI YKSVLQAVQKTDEGHPF 657 

Qy 1045 RAYLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGA 1104 

= M I I I : : I : I : I I I : : : I I I I : I I I I t I I I I I I I I I I I I I I I I I : Mill 

Db 658 KAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGA 717 

Qy 1105 LFNGLTLLILALISLFSIPVIYERHQVQIDHYLGIANKSVKDAMAKIQAKIPGLKRKM 1163 

I I I I M I I : : I : : I : I : : I I : I : I I III I I I I : : I I I I I I I M II I : 

Db 718 LFNGLTLLLMAWSMFTLPWYVKHQAQIDQYLGLVRTHINAWAKIQAKIPGAKRHAE 776 

RESULT 6 
RTN3_HUMAN 

ID RTN3_HUMAN STANDARD; PRT; 23 6 AA. 



AC 095197; 

DT 16-OCT-2001 (Rel. .40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon protein 3 (Neuroendocrine-specif ic protein-like 2) (NSP-like 

DE protein II) (NSPLII). 

GN RTN3 OR NSPL2. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TISSUE=Retina; 

RX MEDLINE-99265974; PubMed=1033 194 7 ; 

RA Moreira E.F., Jaworski C.J., Rodriguez I.R.; 

RT "Cloning of a novel member of the reticulon gene family (RTN3) : gene 

RT structure and chromosomal localization to llql3."; 

RL Genomics 58:73-81(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y. , Du G., Yuan J., Qiang B. ; 

RT "Cloning and expression analysis of a cDNA encoding a novel 

RT neuroendocrine-specif ic protein-like protein 1: NSPL1."; 

RL Submitted (JAN-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain, Eye, and Lymph; 

RX MEDLINE-22388257; PubMed=12477 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E. , Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J. , Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential) . 

CC -!- TISSUE SPECIFICITY: WIDELY EXPRESSED WITH HIGHEST EXPRESSION IN 
CC BRAIN. THREE TIMES MORE ABUNDANT IN MACULA THAN IN PERIPHERAL 

CC RETINA. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 



CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC — 

DR EMBL; AF059524; AAC99319.1; 

DR EMBL; AF059529; AAD20951.1; 

DR EMBL; AF059525; AAD20951.1; JOINED. 

DR EMBL; AF059526; AAD20951.1; JOINED. 

DR EMBL; AF059527; AAD20951.1; JOINED. 

DR EMBL; AF059528; AAD2 0951.1; JOINED. 

DR EMBL; AF119297; AAD26810.1 

DR EMBL; BC000634; AAH00634.1 

DR EMBL; BC010556; AAH10556.1 

DR EMBL; BC011394; AAH11394.1 

DR EMBL; BC022993; AAH22993.1 

DR Genew; HGNC: 10469; RTN3 . 

DR MIM; 604249; 

DR GO; GO: 0005615; C : extracellular space; TAS . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 68 88 POTENTIAL. 

FT TRANSMEM 177 197 POTENTIAL. 

FT DOMAIN 4 8 236 RETICULON. 

SQ SEQUENCE 236 AA; 25609 MW; DDC6A4544ABCDFB7 CRC64; 



Query Match 10.7%; Score 625.5; DB 1; Length 236; 

Best Local Similarity 59.1%; Pred. No. 1.7e-17; 

Matches 114; Conservative 41; Mismatches 37; Indels 1; Gaps 



l; 



Qy 

Db 



972 SKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGV 1031 
I : I I I I I I : I I I I III : I : I I I I I I : : I I : I : I I I I I I I I I I I I I I I 
44 SSCAVHDLIFWRDVKKTGFVFGTTLIMLLSLAAFSVISWSYLILALLSVTISFRIYKSV 103 



Qy 

Db 

Qy 

Db 

Qy 

Db 



1032 IQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLK 1091 
I I I : I I I : I I I I I : I I I : : : : I I I I : I : I : I : I : I II I I : I I I I I I I 

104 IQAVQKSEEGHPFKAYLDVDITLSSEAFHNYMNAAMVHINRALKLIIRLFLVEDLVDSLK 163 

1092 FAVLMWFTWGALFNGLTLLILALISLFSIPVIYERHQV^ 1151 
II II: I I II I : I I I : I I I I I I : : I I : I : : I I : : : I I I I I : I : I I : II 

164 LAVFMWLMTYVGAVFNGITLLILAELLIFSVPIVYEKYKTQIDHYVGIARDQTKSIVEKI 223 

1152 QAKIPGL-KRKAD 1163 
I I I : I I : I : I I : 
224 QAKL P GI AKKKAE 236 



RESULT 7 
RTN3_MOUSE 

ID RTN3_MOUSE STANDARD; PRT; 237 AA. 

AC Q9ES97; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 



DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon protein 3. 

GN RTN3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y. , Qiang H., Yuan J., Qiang B. ; 

RT "Cloning and expression profile of a novel mouse cDNA encoding a human 

RT RTN3 homolog."; 

RL Submitted (OCT-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Eye; 

RX MEDLINE=22 38 8257; PubMed=12477 932 ; 

RA Strausberg R.L., Feingold E . A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B . , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. r Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X. f Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential) . 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF195940; AAG31360.1; -. 

DR EMBL; BC014697; AAH14697.1; -. 

DR MGD; MGI : 1339970; Rtn3. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 69 8 9 POTENTIAL . 



FT TRANSMEM 167 187 POTENTIAL. 

FT DOMAIN 4 9 2 37 RETI CULON . 

SQ SEQUENCE 237 AA; 25428 MW; EB60A0A7AC45F0DE CRC64; 



Query Match 10.7%; Score 625.5; DB 1; Length 237; 

Best Local Similarity 59.1%; Pred. No. 1.7e-17; 

Matches 114; Conservative 41; Mismatches 37; Indels 1; Gaps 1; 

Qy 972 SKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIAIALLSVTISFRIYKGV 1031 

I : I I I :: I I I : I I II III : I : I I I I I I : : I I : I : I I I I I I I I I I I : I I I 
Db 45 S SCAVHDLI FWRDVKKTGFVFGTTLIMLLSLAAFSVI SWS YLI LALLSVTI SFRVYKSV 104 

Qy 1032 IQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLK 1091 

I I I : I I I : I I I I I : I I I : : : : I I I I : I : I I I : I : I II I I : I I I | I I I 

Db 105 IQAVQKSEEGHPFKAYLDVDITLSSEAFHNYMNAAMVliWKALKLIIRLFLVEDLVDSLK 164 

Qy 1092 FAVLMWFTWGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGIJ\NKSVKDAMAKI 1151 

II II: I I I I I : I I I : I I I M I : : I I : I : : I I : : : I I I I I : I : I I : M 

Db 165 LAVFMWLMTYVGAVFNGITLLILAELLVFSVPIVYEKYKTQIDHYVGIARDQTKSIVEKI 224 

Qy 1152 QAKI PGL-KRKAD 1163 

I I I : I I : I : I I : 
Db 225 QAKLPGIAKKKAE 237 



RESULT 8 
RTN2__MOUSE 

ID RTN2_MOUSE STANDARD; PRT; 471 AA. 

AC 070622; 070620; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) (NSP-like 

DE protein 1) (NSPLI) . 

GN RTN2 OR NSPLI. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2), AND TISSUE SPECIFICITY. 

RC STRAIN=FVB/N, and 129/Sv; TISSUE=Cerebellum, and Skeletal muscle; 

RX MEDLINE-98191726; PubMed=9530622 ; 

RA Geisler J.G., Stubbs L.J., Wasserman W.W., Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression."; 

RL Mamm. Genome 9:274-282(1998). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC TISSUE=Retina; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L . , Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 



RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J. , Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Membrane-bound. Endoplasmic reticulum 

CC (Potential) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isof orms=2 ; 

CC Name=l; Synonyms=Brain; 

CC IsoId=O70622-l; Sequence=Displayed; 

CC Name-2; Synonyms=Muscle; 

CC IsoId=O70622-2; Sequence=VSP_00'5650, VSP_005651; 

CC -!- TISSUE SPECIFICITY: Expressed predominantly in neural and muscular 
CC tissues. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF038537; AAC14906.1; 

DR EMBL; AF038537; AAC14907.1; -. 

DR EMBL; AF038538; AAC14908.1; -. 

DR EMBL;, AF038539; AAC14909.1; 

DR EMBL; AF093624; AAD13195.1; -. 

DR EMBL; BC031370; AAH31370.1; 

DR MGD; MGI: 107612; Rtn2 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane. 

FT TRANSMEM 295 315 POTENTIAL. 

FT DOMAIN 272 471 RETICULON. 

FT VARSPLIC 1 267 Missing (in isoform 2) . 

FT / FT I d= VS P_ 0 0 5 6 5 0 . 

FT VARSPLIC 268 271 PLLL -> MGSK (in isoform 2) . 

FT /FTId=VSP_005651. 

SQ SEQUENCE 471 AA; 51346 MW; 9BBD8F372CF63AD3 CRC64; 



Query Match 8.7%; Score 509; DB 1; Length 471; 

Best Local Similarity 28.6%; Pred. No. le-12; 

Matches 146; Conservative 84; Mismatches 175; Indels 106; Gaps 13; 



Qy 690 PDFSNYSEIAKFEKSVPEHAELVEDSSP — ESEPVDLFSDDSIPEVPQTQEEAVMLMKES 747 

I I : : I I I : I II I II : I 
Db 6 PVFAHCKEAPSTASSTPDSTEGGNDDSDFRELHTAREFSED 46 

Qy 74 8 LTEVSETVAQH KEERLSASPQELGKPYLESFQPNLHST 785 

I I I : I : : : | : | : | | | 

Db 47 — EEEETTSQDWGTPRELTFSYIAFDGWGSGGRRDSWRRPRPQGRSVSEPRDPPQQSG 104 

Qy 786 KDAASNDI PTLTKK EKISLQMEEFNTAI YSNDDLLSSKEDK 826 

: I I : I : : I : : I : : : : : | | | | 

Db 105 LGDSLESIPSLSQSPEPGRRGDPDPVPPAERPLEELRLRLDQLGWWRS AGSGED- 159 

Qy 827 IKESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCL 886 

I I : I : I : I I : I : I I I : I I : : I I 
Db 160 SATSSSTPLE--NEEPDGLEASE AGEETNLEL RLAQ SL 195 

Qy 887 ELPCDLSFKNIYPKDEVHVSDEFSENRSSVS KASISPSNVSALEPQTEMGSIVKSKSLTK 946 

I : : : I : I I I I : I : I I : : : : : I 

Db 196 HLQLEVLTPQLSPSSGTPQAHTPSPQRSQDSNS--GPDDEPLLNVVEEHWRLLEQEPITA 253 

Qy 947 EAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFS 1006 

: I : : : I : I I I I I I I : I : : I I I : III II 

Db 254 QC LDSTDQSEFMLEPLL LVADLLYWKDTRTSGAVFTGLMASLLCLLHFS 302 

Qy 1007 IVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSA 1066 

I I I I I :: I I I | | | | : | : | : | | : : | : | | : | | | : : : : : | : : I 
Db 303 IVSVAAHLALLGLCATISLRVYRKVLQAVHRGDGTNPFQAYLDMDLTLTREQTERLSQQI 362 

Qy 1067 LGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTWGALFNGLTLLILALISLFSIPVIY 1126 

II II : I I I I I : I I I I I I I hi : : I : I I I : I I I I I I : I I : : : I. I : : I : : I 

Db 363 ASHWSTATQLRHFFLVEDLVDSLKLALLFYILTFVGAI FNGLTLVILGWALFTVPLLY 422 

Qy 1127 ERHQVQI DH YLGLANKSVKDAMAKI QAKI PG 1157 

: I I I I I I : I I : I II : I I I I I 

Db 423 RQHQAQI DQYVGLVTNQLSHI KAKI RAKI PG 453 



RESULT 9 
RTN2_HUMAN 

ID RTN2_HUMAN STANDARD; PRT; 545 AA. 

AC 075298; 060509; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) (NSP-like 

DE protein 1) (NSPLI) . 

GN RTN2 OR NSPLI. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A., ALTERNATIVE SPLICING, AND TISSUE SPECIFICITY. 

RC TISSUE=Lung carcinoma; 

RX MEDLINE=98360096; PubMed=9693037 ; 

RA Roebroek A.J.M., Contreras B., Pauli I.G.L., Van de Ven W.J.M.; 

RT "cDNA cloning, genomic organization, and expression of the human RTN2 



RT gene, a member of a gene family encoding reticulons. "; 

RL Genomics 51:98-106(1998). 

RN [2] 

RP SEQUENCE OF 108-545 FROM N. A. (ISOFORM RTN2-B) . 

RC TISSUE=Brain; 

RX MEDLINE=98191726; PubMed=9530622 ; 

RA Geisler J.G., Stubbs L.J., Wasserman W.W., Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression."; 

RL Mamm. Genome 9:274-2 82(1998). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 
CC reticulum (Potential) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event^Alternative splicing; Named isoforms=2; 

CC Name=RTN2 -A; 

CC Isold=0752 98-1; Sequence=Displayed; 

CC Note=Isoform RTN2-C is produced by alternative initiation at 

CC Met-341 of isoform RTN2-A; 

CC Name=RTN2 - B ; 

CC IsoId=075298-2; Sequence=VSP_005649 ; 

CC Event=Alternative initiation; 

CC Comment=2 isoforms, RTN2-A (shown here) and RTN2-C, are produced 

CC by alternative initiation at Met-1 and Met-341; 

CC -!- TISSUE SPECIFICITY: ISOFORM RTN2-C IS HIGHLY EXPRESSED IN SKELETAL 
CC MUSCLE , 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC . 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF004222; AAC32542.1; -. 

DR EMBL; AF004223; AAC32543.1; -. 

DR EMBL; AF004224; AAC32544.1; -. 

DR EMBL; AF038540; AAC14910.1; -. 

DR Genew; HGNC: 104 68; RTN2 . 

DR MIM; 603183; -. 

DR GO; GO:0030176; C:integral to endoplasmic reticulum membrane; NAS . 

DR GO; GO:0004871; F:signal transducer activity; NAS. 

DR GO; GO:0007165; P:signal transduction; NAS. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 



DR 


PROSITE; PS50845; 


RETICULON; 


1. 




KW 


Endoplasmic 


reticulum; Alternative splicing; Transmembrane; 




KW 


Alternative 


initiation . 






FT 


CHAIN 


1 


545 


RETICULON PROTEIN 2, ISOFORM RTN2 


-A. 


FT 


CHAIN 


341 


545 


RETICULON PROTEIN 2, ISOFORM RTN2 


-c. 


FT 


INIT MET 


341 


341 


FOR ISOFORM RTN2-C. 




FT 


TRANSMEM 


368 


388 


POTENTIAL. 




FT 


TRANSMEM 


463 


483 


POTENTIAL. 




FT 


DOMAIN 


345 


545 


RETICULON. 




FT 


VARSPLIC 


272 


344 


Missing (in isoform RTN2-B) . 




FT 








/FTId=VSP 005649. 





SQ SEQUENCE 545 AA; 59263 MW; 971FD2F909E1E9E6 CRC64; 



Query Match 8.4%; Score 490; DB 1; Length 545; 

Best Local Similarity 28.3%; Pred. No. 6.3e-12; 

Matches 160; Conservative 76; Mismatches 2 02; Indels 12 8 ; Gaps 17; 

Qy 668 EAPYISIACDLIKETKLSTEPSPDFSNYSEI AKFEKSVPEHAELVEDSSPESEPVDL 724 

III I I I I I I : : I : : I : I : : I 

Db 13 EAP STASSTPDSTEGGNDDSDFRELHTAREFSEEDEEETTSQDWGTPRELTFSY 66 

Qy 725 FSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYL-ESFQPNLH 783 

: I : : : : III I I : I I : : I I : I : I 

Db 67 IAFDGWGSGGRRDSTARRPRPQGRSVSEPRDQHPQPSLGDSLESI — PSLSQSPEPGRR 124 

Qy 784 STKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIID 843 

II I : I : I : : : I I I : I I : I : I 
Db 125 GDPDTAP PSERPLEDLRLRLDHLGWVARGT GSGED SSTSSSTPLE 169 

Qy 844 EFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPKDEV 903 

1:1 : I I I : I MM: I 

Db 170 DEEP QEPNRLETGEAGE EL — DLRLRLAQPSSPE 201 

Qy 904 HVSDEFSENRSSVSKASISP SNVSALEPQTE 934 

: : : I : : I I II III 

Db 202 VLTPQLSPGSGTPQAGTPSPSRSRDSNSGPEEPLLEEEEKQWGPLEREPVRGQCLDSTDQ 261 

Qy 935 MGSIVK — SKSLTKEAEKKLP S 954 

: I : : : II I : I 

Db 2 62 LEFTVEPRLLGTAMEWLKTSLLLAVYKTVPILELSPPLWTAI GWVQRGPTPPTPVLRVLL 321 

Qy 955 DTEKEDRSLSAVLSAELSK TSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVS VT 1011 

I I I I I I I : I I II I I : I : : I I I I : Ml I I I I I I 

Db 322 KWAKSPRS-SGVPSLSLGADMGSKVADLLYWKDTRTSGWFTGLMVSLLCLLHFSIVSVA 380 

Qy 1012 AYIALALLS VTISFRIYKGVI QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVN 1071 

I : : I I I I I I I | : | : | : M : : I : II : I II : : : : : I : : I : I 
Db 381 AHLALLLLCGTISLRVYRKVLQAVHRGDGANPFQAYLDVDLTLTREQTERLSHQITSRW 440 

Qy 1072 STIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQV 1131 

I : I I I II : II I I I I I I : I : : I : I II : I II I II I II : I || : | | : : | : I I 
Db 441 SAATQLRHFFLVEDLVDSLKLALLFYILTFVGAI FNGLTLLILGVIGLFTIPLLYRQHQA 500 

Qy 1132 QIDHYLGLANKSVKDAMAKIQAKIPG 1157 

IMIMI : I I I : I I II I 

Db 501 QIDQYVGLVTNQLSHIKAKIRAKIPG 526 



RESULT 10 
PCLO_HUMAN 

ID PCLO_HUMAN STANDARD; PRT; 5147 AA. 

AC Q9Y6V0; 043373; O60305; Q9BVC8; Q9UIV2; Q9Y6U9; 
DT 28-FEB-2003 (Rel. 41, Created) 
DT 28-FEB-2003 (Rel. 41, Last sequence update) 
DT 10-OCT-2003 (Rel. 42, Last annotation update) 
DE Piccolo protein (Aczonin) (Fragments). 
GN PCLO OR ACZ OR KIAA0559. 
OS Homo sapiens (Human) . 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominiciae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE OF 1-759 FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=99439764; PubMed=10508862 ; 

RA Wang X., Kibschull M. , Laue M.M. , Lichte B., Petrasch-Parwez E., 

RA Kilimann M.W. ; 

RT "Aczonin, a 550-kd putative scaffolding protein of presynaptic active 

RT zones, shares homology regions with rim and bassoon and binds 

RT profilin."; 

RL J. Cell Biol. 147:151-162(1999). 

RN [2] 

RP SEQUENCE OF 552-4404 FROM N.A. 

RA Kraemer J., Wollam C, Wohldmann P., McGrane B.; 

RL Submitted (DEC-1999) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE OF 3619-5147 FROM N.A. (ISOFORM 2). 

RC TISSUE-Brain; 

RX MEDLINE=98290545; PubMed=962 8581 ; 

RA Nagase T., Ishikawa K.-I., Miyajima N., Tanaka A., Kotani H., 

RA Nomura N . , Ohara O. ; 

RT "Prediction of the coding sequences of unidentified human genes. IX. 

RT The complete sequences of 100 new cDNA clones from brain which can 

RT code for large proteins in vitro."; 

RL DNA Res. 5:31-39(1998). 

RN [4] 

RP SEQUENCE OF 4405-4439 FROM N.A. 

RC TISSUE=Placenta; 

RX MEDLINE=22388257; PubMed=12477 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L. , Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S w Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E. , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2 002). 

RN [5] 

RP SEQUENCE OF 4405-5147 FROM N.A. 

RA Kalicki J., Elliott G.; 

RL Submitted (FEB-1998) to the EMBL/GenBank/DDBJ databases.. 

CC -!- FUNCTION: May act as a scaffolding protein involved in the 

CC organization of synaptic active zones and in synaptic vesicle 



CC trafficking (By similarity) . 

CC -!- SUBUNIT: Interacts with Rabacl/Pral, RIMS2 and profilin (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Concentrated at the presynaptic side of 
CC synaptic junctions (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Comment=Additional isoforms seem to exist; 

CC Name=l; 

CC IsoId=Q9Y6V0-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q9Y6V0-2; Sequence=VSP_003923, VSP_003924, VSP_003925 / 

CC VSP_003926 f VSP_003927; 

CC Note=No experimental confirmation available; 

CC -!- DOMAIN: C2 domain 1 is involved in binding calcium and 

CC phospholipids. Calcium binds with low affinity but with high 

CC specificity and induces a large conformational change. 

CC -!- SIMILARITY: Contains 2 C2 domains. 

CC -!- SIMILARITY: Contains 1 PDZ/DHR domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Y19188; CAB60727.1; -. 

DR EMBL; AC004903; AAD20936.1; 

DR EMBL; AC004886; AAD21789.1; -. 

DR EMBL; AB011131; BAA25485.1; -. 

DR EMBL; BC001304; AAH01304.1; -. 

DR EMBL; AC004082; AAB97937.1; -. 

DR PIR; T00634; T00634. 

DR HSSP; P04410; 1A25. 

DR Genew; HGNC: 13406; PCLO. 

DR MIM; 604918; -. 

DR GO; GO: 0005856; C : cytoskeleton; NAS . 

DR GO; GO: 0045202; C: synaptic junction; ISS. 

DR GO; GO:0005509; F: calcium ion binding; ISS. 

DR GO; GO: 0005544; F: calcium-dependent phospholipid binding; ISS. 

DR GO; GO: 0005522; F:profilin binding; ISS. 

DR GO; GO:0007010; P : cytos keleton organization and biogenesis; ISS. 

DR GO; GO: 0016080; P: synaptic vesicle targeting; ISS. 

DR InterPro; IPR000008; C2 . 

DR InterPro; IPR001565; Synaptotagmin . 

DR PRINTS; PR00360; C2 DOMAIN . 

DR PRINTS; PR0039 9; SYNAPTOTAGMN . 

DR SMART; SM00239; C2 ; 2. 

DR PROSITE; PS00499; C2_DOMAIN_l; 1. 

DR PROSITE; PS50004; C2_DOMAIN_2; 2. 

KW Calcium/phospholipid-binding; Zinc; Metal-binding; Zinc-finger; 

KW Repeat; Alternative splicing. 

FT NONJTER 1 1 

FT DOMAIN 400 465 10 X 10 AA TANDEM APPROXIMATE REPEATS OF 

FT P-A-K-P-Q-P-Q-Q-P-X . 



FT 


ZN_FING 


499 


523 


C4-TYPE (POTENTIAL) . 


FT 


ZN FING 


969 


992 


C4-TYPE (POTENTIAL) . 


FT 


NON_CONS 


1010 


1011 




FT 


DOMAIN 


2300 


2325 


POLY- PRO . 


FT 


DOMAIN 


4391 


4442 


PDZ. 


FT 


DOMAIN 


4544 


4633 


C2 DOMAIN 1. 


FT 


DOMAIN 


5031 


5121 


C2 DOMAIN 2. 


FT 


VARSPLIC 


4404 


4404 


S -> SGNGLGIRIVGGKEIPGHSGEIGAYIAKILPGGSAE 


FT 








QTGKLMEG (in isoform 2). 


FT 








/FTId=VSP_003923. 


FT 


VARSPLIC 


4534 


4534 


K -> KPTDGTKWSHPITGEIQ (in isoform 2) . 


FT 








/FTId=VSP 003924. 


FT 


VARSPLIC 


4576 


4576 


G -> GQVMWQNAS (in isoform 2). 


FT 








/FTId=VSP_003925. 


FT 


VARSPLIC 


4757 


4761 


TAHKS -> SKRRK (in isoform 2) . 


FT 








/FTId=VSP_003926. 


FT 


VARSPLIC 


4762 


5147 


Missing (in isoform 2) . 


FT 








/FTId=VSP_003927 . 


SQ 


SEQUENCE 


5147 


AA; 563537 


MW; CD5D84990498CD3C CRC64; 



Query Match 5.3%; Score 308.5; DB 1; Length 5147; 

Best Local Similarity 21.4%; Pred. No. 0.00076; 

Matches 264; Conservative 150; Mismatches 438; Indels 383; Gaps 63; 

Qy 3 DIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERKPA 62 

II: I I I I : I I : : I : : : I : || 
Db 251 DIVRGESVKPSLPSPSKPP IQQPTPGKPPAQQPGHEKSQPG PAKPP 2 96 



Qy 63 AGLSAAAVP PAAAAPLLDFSSDSVPPA-PRGPLPAAP PA A 101 

II I I I : : III I I II III I 

Db 297 AQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLG — PAKPPAQQTGSEKPSSEQPGPKA 354 

Qy 102 PERQPSWERSPA-APAPSLP PAAAVLPSKLPEDDEPPA 138 

: I :M I I I: I III hi I :| |: 

Db 355 LAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPAQTKPPSQQP 414 

Qy 139 RPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPSSAEKIM 195 

:IM I I I I I II :: II I |:| : III: 
Db 415 GSTKPPPQQPGPAKP — SPQQPGSTKPPSQQPGS AKPSAQQ PSPAKPSA 461 

Qy 196 DLMEQ PGNTVS S GQEDFP S VLLETAAS LPS LS PL- STVS FKEH-GYLGNLSAVS 247 

: I : I : I : : I II : I I : I I I : : 

Db 462 QQFTKPVSQTGFGKPLQPPTVSPSAKQPPSQGLPKTICPLCNTTELLLHVPEKANFNTCT 521 



Qy 248 SSEGTI EETLNEASKEL PERATNPFVNRDL 277 

: I : I I I : I I : I I 

Db 522 ECQTTVCSLCGFNPNPHLTEAKEWLCLNCQMKRALGGDLAPVPSSPQPKLKTAPVTTTSA 581 

Qy 27 8 AEFS ELE YSEMGS S FKGS PKGESAI LVENTKEEVI VRS KDKEDLVCSAALHS PQES PVGK 337 

I : : : II : : II : I : : I I : : I 

Db 582 VSKSSPQPQQTSPKKDAAPKQDLSKAPEPKKPPPLVKQPTLHGSPSAKAKQPPE7VDSLSK 641 

Qy 338 EDRWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVES 397 

: I I : : 1 III I I I I : I III: I : : I : 

Db 642 P APPKEPSVPSEQDK APV ADDKP-KQPKMVKPTTD LVSSSSATTKP 68 6 



Qy 



398 KVDRKCLEDSLEQKS LGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FTSATESTT 454 



: MM III :: III I I I | | 

Db 687 DIPSSKVQSQAEEKTTPPLKTDSAKPSQ--SFPPTGEKV TPFDSKAIPRP 734 

Qy 455 ANTFPLLE — DHTSENKTD EKKIEERKAQIITEKTSPK 490 

M : : : I I : I : I I I : I I I I I I I 

Db 735 ASDSKIISHPGPSSESKGQKQVDPVQKKEEPKKAQ TKMSPKPDAKPMPKGSPTPPGP 791 

Qy 491 — TSNPFLVAVQDS EADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEAT 544 

I : : I I I : | : | : | | | | | | 
Db 7 92 RPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITDAPKSQ PTTPQET VT 838 

Qy 545 GTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVNEAPLNSLLPSA 604 

I : : | | | : | | : | : : I I I I :: 

Db 839 GKLFGFGASI - FSQASNLI STAGQPG PHSQSGPGAP MKQAPAPSQPPTS 886 

Qy 605 GASWQPSVSPLEAPP PVSYD SIKLEP ENPPPYEE 639 

I I : I I I II: : I I I I : I | | : : 

Db 887 QG P P KS T GQAP P AP AKS I P VKKET KAP AAEKLE P KAEQAP T VKRT ET E KK P P P I KD 942 

Qy 640 AMNVALK ALGTK EGIKEPESFNAAVQ E 666 

: =: : I I I I |:| :|| : 

Db 943 SKSLTAEPQKAVLPTKLEKSPKPESTCPLCKTELNIGSKDPPNFNTCTECKNQVCNLCGF 1002 

Qy 667 TEAPYISIAC DLIK ETKLSTEPSPDFSNYSEIAKFEKSVPEHA 709 

I : : : I hi I I I I : I : : : I I 

Db 1003 NPTPHLTENCQTQRAISGQLGDIRKMPPAPSGPKAS PMPVP TESSSQKTAVPPQV 1057 

Qy 710 ELV EDSSPESEPV DLFSDDSIPEV PQTQEEAVMLMKE S LTEVS ETVA 756 

: II : : | : | | : I : I I : MM: I I : : : I 

Db 1058 KLVKKQEQEVKTEAEKVILEKVKETLSMEKIPPMVTTDQKQEES-KLEKDKASALQEKKP 1116 

Qy 757 QHKEERLSAS PQEL GKPYLES FQPNLHSTKDAASNDI PTLTKKEKI SLQMEEFNTA 812 

: I : : I : : : I I I I : I I I : I I I : 
Db 1117 LPEEKKLIPEEEKIRSEEKKPLLEEKKPTPEDKK LLPEAKTSAPEEQ 1163 

Qy 813 IYSNDDLLSSK EDKI KESETFSDSSPIEIIDEFPTFVSAKDDS- 855 

MM: MM I : : I : I I : : II 

Db 1164 KHDLLKSQVQIAEEKLEGRVAPKTVQEGKQPQTKMEGLP SGTPQSLPKEDDKT 1216 

Qy 856 PKLAKEYTDLEVSDKSEIANIQ SGADSLPCLELPCD 891 

III : I MM :: | | | |:| 

Db 1217 TKTIKEQPQPPCTAKPDQEKE-DDKSDTSSSQQPKSPQGLSDTGYSSDGISSSLGEIP — 1273 

Qy 892 LSFKNIYPKDEVHV SDEFSENRSSVSK ASI SPSNVSALEPQTEMGSIVKSKS 943 

: M I I : II I : I I I : : : I I I I : I I : 

Db 1274 SLIPTDEKDILKGLKKDSFSQESSPSSPSDLAKLESTVLSILEAQASTLADEKSEK 1329 

Qy 944 LTK — EAEKKLPSDTEKEDRSLSAVLSAELSKTSV 976 

I : I Mill MM I : I : : 
Db 1330 KTQPHEVSPEQPKDQEK-TQSLSETLEITISEEEI 1363 



RESULT 11 
PCLO_CHICK 

ID PCLO_CHICK STANDARD; PRT; 5120 AA. 

AC Q9PU36; 

DT 28-FEB-2003 (Rel. 41, Created) 



DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Piccolo protein (Aczonin) (Fragment) . 

GN PCLO OR ACZ. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves ; Neognathae; Gallif ormes ; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Brain; 

RX MEDLINE=99439764; PubMed=10508862 ; 

RA Wang X., Kibschull M., Laue M.M. , Lichte B., Petrasch-Parwez E., 

RA Kilimann M.W. ; 

RT "Aczonin, a 550-kd putative scaffolding protein of presynaptic active 

RT zones, shares homology regions with rim and bassoon and binds 

RT profilin. "; 

RL J. Cell Biol. 147:151-162(1999). 

CC -!- FUNCTION: May act as a scaffolding protein involved in the 
CC organization of synaptic active zones and in synaptic vesicle 

CC trafficking (By similarity) . 

CC -!- SUBUNIT: Interacts with Rabacl/Pral and profilin (By similarity). 

CC -!- SUBCELLULAR LOCATION: Concentrated at the presynaptic side of 
CC synaptic junctions (By similarity) . 

CC -!- DOMAIN: C2 domain 1 is involved in binding calcium and 

CC phospholipids. Calcium binds with low affinity but with high 

CC specificity and induces a large conformational change. 

CC -!- SIMILARITY: Contains 2 C2 domains. 

CC -!- SIMILARITY: Contains 1 PDZ/DHR domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Y19187; CAB60725.1; 

DR HSSP; P04410; 1A25. 

DR GO; GO: 0045202; C:synaptic junction; ISS. 

DR GO; GO:0005509; F: calcium ion binding; ISS. 

DR GO; GO: 0005544; F : calcium-dependent phospholipid binding; ISS. 

DR GO; GO: 0005522; F:profilin binding; ISS. 

DR GO; GO: 0007010; P : cytoskeleton organization and biogenesis; ISS. 

DR GO; GO:0016080; P:synaptic vesicle targeting; ISS. 

DR InterPro; IPR000008; C2 . 

DR InterPro; IPR001478; PDZ . 

DR InterPro; IPR001565; Synaptotagmin . 

DR InterPro; IPR008 8 99; Znf_piccolo. 

DR Pfam; PF00168; C2; 2. 

DR Pfam; PF00595; PDZ; 1. 

DR Pfam; PF05715; Zf_piccolo; 2. 

DR PRINTS; PR00399; SYNAPTOTAGMN . 

DR SMART; SM00239; C2 ; 2. 

DR SMART; SM00228; PDZ; 1. 



DR 


PROSITE; 


PS00499; 


C2 DOMAIN 


l; 1- 


DR 


PROSITE; 


PS50004; 


C2_DOMAIN~ 


2; 2. 


DR 


PROSITE; 


PS50106; 


PDZ; 1. 




KW 


Calcium/phospholipid-binding; Zinc; Metal-binding; Zinc- finger; 


KW 


Repeat . 








FT 


NON_TER 


1 


1 




FT 




258 


357 


1U X ID AA TANDEM APPROXIMATE REPE, 


FT 








P-A-K-P-Q-P-Q-Q-P-X . 


FT 


ZN_FING 


368 


392 


C4-TYPE (POTENTIAL). 


FT 


ZN_FING 


836 


859 


C4-TYPE (POTENTIAL) . 


FT 

C 1 


DOMAIN 


2324 


2343 


POLY-PRO. 


FT 


DOMAIN 


4414 


4493 


PDZ . 


FT 


DOMAIN 


4627 


4726 


C2 DOMAIN 1. 


FT 


DOMAIN 


5003 


5094 


C2 DOMAIN 2. 


SQ 


SEQUENCE 


5120 AA; 560751 


MW; A658D9891B65B412 CRC64; 


Query Match 




5.1%; 


Score 300; DB 1; Length 5120; 


Best Local , 


Similarity 


20.4%; 


Pred. No. 0.0016; 



Matches 258; Conservative 152; Mismatches 464; Indels 388; Gaps 56 

Qy 14 TDSPPRPPPAFKYQFVTEPEDEEDEEE EEDEEEDDEDLEELEVLERKPAAGLSAA 68 

: I I I : I | | : | : | | : ::::::::: | | : 

Db 90 SDSDAAHEEAGRKQKVTQKEQGKPEEQRGLAKHPSQ'QQSPKLVQQQGPVKPTPQQTESSK 149 

Qy 69 AVP PAAAAPLLDFSSDSV — PPAPRGP LPAAP PAAPERQPSWE 109 

II I : I I :: I I I I I I I I : I I : I : 

Db 150 PVPQQQQPGEPKQGQKPGPSHP-GDSKAEQVKQPPQPRGPQKSQLQQSEPTKPGQQQTSA 208 

Qy 110 RSPAAPAPSLP — PAAAVLPSKLPEDDEP PAR P 140 

: : I I I I I : I I : I : I | | | | 

Db 209 KTSAGPTKPLPQQPDSAKTSSQAPPPTKPSLQQSGSVKQPSQQPARQGGPVKPSAQQAGP 268 

Qy 141 PPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDET — LFALPAASEPVIPSS — AEKIMD 196 

I I II: I II I : : | : : | : | | | | | 

Db 269 PKQQPGSEKPTAQQTGPAKQPPQP-GPGKTPLQQTGPVKQVPPQAGPTKPSSQTAGAAKS 327 

Qy 197 LMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEET 256 

I : I I I I I I : I I : : I I I : : I 

Db 328 LAQQPGLTKPPGQQPGPEKPLQQKQA STTQPVESTPKKT 366 

Qy 257 L NEASKELPERAT NPFVNRDLAEFSE LEYS 286 

I I I : I 111:11 :: : 

Db 367 FCPLCTTTELLLHTPEKANYNTCTQCHTWCSLCGFNP — NPHITEI KEWLCLNCQMQRA 424 

Qy 287 EMGS S FKGS PKGESAI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PE- 345 

I II : I I I I I I I I I : I I : 

Db 425 LGGDLAS GHGPGPQ LPPPKQKTPTPASTAKPSPQLQPGQKKDASPKPDP 473 

Qy 346 KTMDIFNEMQMSWAPVREEYAI)FKPFEQAWEVKDTYEGSRDVLAARANVESK 398 

I : : I II:: : I : : : I : | | : | 
Db 474 SQQADSKKPVPQKKQPSMPGSPPVKSKQTHAEPSDTGQQI-DSTPKSDQVKPTQA 527 

Qy 399 VDRKCLEDSLEQKSLGKD SEGRNEDASFPSTPEPVKDSSRAYITCAS FTSA 449 

I: I I: I : | :| : | :| |:: :| : 

Db 528 EEKQNQPSIQKPTMDTVPTSAAPGVKQDLADPQSP STQQKVTDSPMPET 576 



Qy 



450 TESTTANTFPLLEDHTSENKTDEKKIEERKAQIITEKTSP 



KTSNPFLV 497 



Db 577 TK PPADTHPAGDKPDSKPL PQVSRQKSDPKLASQSGAKSDAKTQKPSEP 625 

Qy 4 98 A-VQDSEADYVT TDTLSKVTEAAVSNMPE 525 

I 1:1 I | : | | : I: 

Db 626 APVKDDPKKLQTKPAPKPDTKPAPKGPQAGTGPRPTSAQPAPQPQQPQKTPEQSRRFSLN 685 

QY 526 --GLT PDLVQEACESEL NEA TGTKIAYETKVDLVQTSEAIQ 564 

Ml IM :l ::| I : : I :: I 

Db 686 LGGITDAPKPQPTTPQETVTGKLFGFGASIFSQASSLISTAGQPGSQTSGPAPPATKQPQ 745 

Qy 565 ESLYPTAQLCPSFEEAEATPSP-VLPDIVMEAPLNS — LLPSAGASWQPSVSPLEAPPP 621 

M llhlll I I I I I I I | : I I I I 

Db 746 PPSQPPASQAPPKEAAQAQPPPKAAPTKKETKPLASEKLGPMASDSTLTTKGSDLEKKPS 8 05 

Qy 622 VSYDSIKLEPENPPPYE EAMNVALKALGTKEGIKEPESFNAA 663 

= = I I I II : : I' I I | : | : | | 

Db 806 LAKDSKHQTAEAKKPAELSEQEKASQPKVSCPLCKTGLNIGSKDPPNFNTCTECKKVVCN 865 

Qy 664 VQETEAPYISIAC DLIK 680 

III :: : I I : I 

Db 8 66 LCGFNPMPHIVEVQE WLCLNCQTQRAMSGQLGDMGKVPLPKLGPSQPVSKPPATPQ 921 

Qy 681 ETKLSTEPSPDFSNYSEIAKFEKSVP — EHAEL VEDSS 716 

I I I I : I : I | | | : : I :: 

Db 922 KQPVPAVSHSPQKSSTPPTPAATKPKEEPSVPKEVPKLQQGKLEKTLSADKIQQGIQKED 981 

Qy 717 PESEPVDLF SDDS I PEVPQTQEEAVMLMKES LTEVS ETVAQH-KEERLSAS PQELGK 772 

: I : II Mill::: | : | | : : : : | : : | | 

Db 982 AKSKQGKLFKTPSADKIQRVSQKEDSRLQQTKLTKTPSSDKILHGVQKEDIKFQEAKLAK 1041 

Qy 773 P YLES FQPNLHSTKDA ASNDI PTLTKKEKI SLQMEEFNTAI YSN 816 

I : II II : : : I : I I : I I : | : : : 

Db 1042 IPSADKILHRLQKEDPKLQQMKMAKALSADKIQPEAQKEDVQLQEVRLSKAVSADKIQHG 1101 

Qy 817 —DDL LSSKEDKI KESETFSDSSPIEII-DEFPTFVSA 851 

II : I : I I III: I I : I : | | | : 

Db 1102 IQKDLNLQHVKIEKTSSVEKIQEAQKESKLQQDKLPKTLSEDKIPATVSSDHKKLLSKSE 1161 

Qy 852 KDDSPKLAKEYTDLEVSDKSEIANIQSG — ADSLPCLELPCDLSFKNIYPKDEVHVSDE- 908 

: I M :M Ml : I : : II I I : : I : 1 I 

Db 1162 EDKKPELLEKSTPHPKDKKEQITAETTGHITEQKVEVEAPCD KLHEKKQEDVKKED 1217 

Qy 909 FSENRSSVSKA — SISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRS 962 

: I M : I MM::: I : | M I : | | | | : 
Db 1218 LTTGIPQMVSKPEKAEEEKTPVPVSRL-PRSDHVEAVREK-IEKEDDK SDTSSSQQQ 1272 

Qy 963 LS 964 

I 

Db 1273 KS 1274 

RESULT 12 
CPN_DROME 

ID CPNJDROME STANDARD; PRT; 865 AA. 

AC Q02910; 

DT 01-OCT-1993 (Rel. 27, Created) 



DT 01-OCT-1993 (Rel. 27, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Calphotin. 

GN CPN OR CAP. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae; Drosophila. 

OX NCBI_TaxID=7227; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Canton-S ; 

RX MEDLINE=93165729; PubMed=8094559; 

RA Martin J.H., Benzer S., Rudnicka M., Miller C.A.; 

RT "Calphotin: a Drosophila photoreceptor cell calcium-binding protein."; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:1531-1535(1993). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Canton-S; 

RX MEDLINE=93165730; PubMed=8434015; 

RA Ballinger D.G., Xue N., Harshman K.D.; 

RT "A Drosophila photoreceptor cell-specific protein, calphotin, binds 

RT calcium and contains a leucine zipper."; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:1536-1540(1993). 

CC -!- FUNCTION: Might function as a calcium-sequestering "sponge" to 
CC regulate the amount of free cytoplasmic calcium. It binds 0.3 mole 

CC of Ca(2+) per mole of protein. 

CC -!- SUBUNIT: Homodimer (Probable). 

CC SUBCELLULAR LOCATION: Cytoplasmic; hypodense compartment. 

CC -!- TISSUE SPECIFICITY: Soma and axons of photoreceptor cells of 
CC compound eyes and ocelli . 

CC -!- DEVELOPMENTAL STAGE: Expressed early in photoreceptor cell 
CC development . 

CC ■ 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L02111; AAA28405.1; -. 

DR EMBL; L05080; AAA28420.1; -. 

DR PIR; A47282; A47282. 

DR PIR; A47283; A47283. 

DR FlyBase; FBgn0010218; Cpn . 

DR GO; GO: 0005509; F: calcium ion binding; IDA. 

KW Calcium-binding . 

FT CONFLICT 36 36 A -> AVAPAWA (IN REF. 2). 

FT CONFLICT 43 43 I -> T (IN REF. 2). 

FT CONFLICT 64 64 I -> V (IN REF. 2). 

FT CONFLICT 76 76 T -> A (IN REF. 2). 

FT CONFLICT 100 100 P -> PP (IN REF. 2). 

FT CONFLICT 126 127 VQ -> AP (IN REF. 2). 

FT CONFLICT 154 154 I -> V (IN REF. 2) . 

FT CONFLICT 160 160 S -> T (IN REF. 2). 



FT CONFLICT 534 534 A -> E (IN REF. 2). 

FT CONFLICT 699 699 I -> T (IN REF. 2). 

FT CONFLICT 703 703 V -> L (IN REF. 2). 

FT CONFLICT 721 721 D -> E (IN REF. 2). 

SQ SEQUENCE 865 AA; 84781 MW; 2 1104 17E0B0E7CFE CRC64 ; 



Query Match 5.1%; Score 295.5; DB 1; Length 865; 

Best Local Similarity 21.3%; Pred. No. 0.00024; 

Matches 217; Conservative 127; Mismatches 379; Indels 295; Gaps 41 

Qy 62 AAGLSAAAVP PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPER QPSWERSPAAPAP 117 

: I : : I Mill: I : I I I I I : Ml I I : Mill 

Db 11 SAPVAAPVTPSAV7^PVQWSPAAVAPAPAAPIAVTPVAPPPTLASVQPATVTIP--APAP 69 

Qy 118 SLPPAAAVLPSKLPEDDEPPARPPPPPPAGA SPLAEPAAPPSTP 161 

I M I : II : Ml Ml : I : I I I | I 

Db 70 IAAASVTPVASVAPPWAAPTPPAASPVSTPVAVAQIPVAVSAPVAPPVAATPTPWQIP 129 

QY 162 -AAPKRRGS GS VDETLFALP--AASEP VIPSSAEKIMDLMEQPGNT VSSGQED 211 

Ml M I I I I I || : | IN | : : 

Db 130 VAAP VI AT P P VAAS APT PAAVT PVI S PVIAS P P WPANTTVPVAAPVAA 178 

Qy 212 FPSVLLETAASL-PSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATN 270 

1= : II l:::l I :: : : || | : |: || 

Db 179 VPAAVPWAPVLAPAVAPAVAPWAETPAPPPVAEIPVA — TIPECVAPLIPEVSWATK 236 

Qy 271 PFVNRDLAEFSELEYSEMGSSFKGSPKGESAI LVENTKEEVIVRSKDKEDLVCSAALHSP 330 

I M : I I M I I I 

Db 237 PLA AAE P VWAP PAT ET P WAP AAAS P 263 

Qy 331 QESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLA 390 

I I : I : : : I I I I I 
Db 264 HVS VAP AVETAWAPV 279 

QY 391 ARANVESKVDRKCLEDSLEQKSL GKDSEGRNEDASFPSTPEPVKDSSRAYITCASF 446 

I: I I I M :| : I Mill:: : : 

Db 280 - S AS T E P P VAAAT LTTAP ET P ALAP WAE S QVAANT WAT PPT PAP E P ET I AP P WAET P 338 

Qy 447 TSATESTTANTFPLLEDHTSENKTDEKKIEERKAQIITEKTSPKTSNPFLVAVQDSEADY 506 

I : : M : : : I : | I I :|| I : 
Db 339 EVASVAVAETTPPWPPVAAES 1 PAP WAT T P VP AT LAVT D P D 381 

Qy 507 VTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQES 566 

I M I I : I : I I | | | | | | 
Db 382 VTASAVPELPPVIAPSPVPSA VAETPVDLA PPV 414 

Qy 567 LYP-TAQLCPSF— EEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAP 619 

M I : I : M I I : I : M I : : I : I I : I I || 

Db 415 L P PVAAE PVP AWAEET P ET P AP AS AP VT I - AALD — I P EVAP VI AAP S DAP AEAP S AAA 471 

Qy 620 PPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACD — 677 

I II II I: :: : 

Db 472 PIVS TPPT TASVPETTAPPAAVPTEPI 4 98 

Qy 678 LIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE — LVEDSSPESEPVDLFSD-DSIP 731 

: M M : I : I : I : M I I : : I I : I I I : I 

Db 499 DVSVLSEAAIETPVAPPVEVTTEVAVADVAPPEAAADLIIEPVEPPAPIPDLLEQTTSVP 558 



Qy 


732 


Db 


559 


Qy 


792 


Db 


615 


Qy 


852 


Db 


652 


Qy 


900 


Db 


708 


Qy 


937 


Db 


767 



: I 



I III I : 



KDDSPKLAKEYTDLEV-SDKSEIANIQSGAD SLPCLEL PCDLSFKNIYP- 8 99 

I I : | : | : : : : | | | | | : | : | : | : : | | 



IVJ3 DEFSENRS S VS KAS I S P SNVSALEPQTEMG 936 

I : I I : : : : I : II: 



HI: I hi: : I I I I : : I | 



RESULT 13 
MAP B_ RAT 

ID MAPB_RAT STANDARD; PRT ; 2459 AA. 

AC P15205; Q62958; Q9ER21; Q9QW92; 

DT 01-APR-1990 (Rel. 14, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) (Neuraxin) [Contains: MAPI 

DE light chain LCI] . 

GN MAP1B. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE OF 1-142 FROM N.A. 

RC STRAIN=Sprague-Dawley; TISSUE=Tes tis ; 

RX MEDLINE=96257242; PubMed=8666295 ; 

RA Liu D. , Fischer I . ; 

RT "Isolation and sequencing of the 5 f end of the rat microtubule- 

RT associated protein (MAP1B) -encoding cDNA. " ; 

RL Gene 172:307-308(1996). 

RN [2] 

RP SEQUENCE OF 96-2459 FROM N.A., DOMAIN, AND INDUCTION. 

RC STRAIN=Sprague-Dawley; TISSUE=Brain, and Glial tumor; 

RX MEDLINE=92347374; PubMed=1639092 ; 

RA Zauner W. , Kratz J., Staunton J., Feick P., Wiche G. ; 

RT "Identification of two distinct microtubule binding domains on 

RT recombinant rat MAP IB."; 

RL Eur. J. Cell Biol. 57:66-74(1992). 

RN [3] 

RP SEQUENCE OF 1541-2459 FROM N.A. , AND TISSUE SPECIFICITY. 

RC TISSUE=Spinal cord; 

RX MEDLINE=90059871; PubMed-2555150 ; 

RA Rienitz A., Grenningloh G. , Hermans-Borgmeyer I., Kirsch J., 



RA Littauer U.Z., Prior P., Gundel finger E.D., Schmitt B . , Betz H.; 

RT "Neuraxin, a novel putative structural protein of the rat central 

RT nervous system that is immunologically related to microtubule- 

RT associated protein 5."; 

RL EMBO J. 8:2879-2888(1989). 

RN [4] 

RP DEVELOPMENTAL STAGE, AND PHOSPHORYLATION. 

RX MEDLINE=97405699; PubMed=92 607 43 ; 

RA Ma D., Nothias F . , Boyne L.J., Fischer I.; 

RT "Differential regulation of microtubule-associated protein IB (MAPlB) 

RT in rat CNS and PNS during development-"; 

RL J. Neurosci. Res. 4 9:319-332(1997). 

CC -!- FUNCTION: The function of brain MAPS is essentially unknown. 
CC Phosphorylated MAPlB may play a role in the cytoskeletal changes 

CC that accompany neurite extension. Possibly MAPlB Binds to at least 

CC two tubulin subunits in the polymer, and this bridging of subunits 

CC might be involved in nucleating microtubule polymerization and in 

CC stabilizing microtubules. 

CC -!- SUBUNIT: 3 different light chains, LCI, LC2 and LC3, can associate 
CC with MAPI A and MAPlB proteins. 

CC -!- TISSUE SPECIFICITY: Nervous system (spinal cord, brain stem, 

CC cerebellum and cerebrum) . Not expressed in liver, spleen, kidney, 

CC heart or muscle. 

CC -!- DEVELOPMENTAL STAGE: In cerebral cortex, spinal cord and sciatic 
CC nerve levels are high early in development but decrease during 

CC postnatal development and are low in adults. In dorsal root 

cc ganglia levels remain high throughout development. 

CC -!- INDUCTION: By nerve growth factor. 

CC -!- DOMAIN: Has a highly basic region with many copies of the sequence 
CC KKEE and KKEI/V, repeated but not at fixed intervals, which is 

CC responsible for the binding of MAPlB to microtubules. 

CC -!- PTM: LCI is coexpressed with MAPlB. It is a polypeptide generated 
CC from MAPlB by proteolytic processing. It is free to associate with 

CC both MAP 1A and MAPlB. It interacts with the amino-terminal region 

CC of MAPlB (By similarity) . 

CC -!- PTM: Phosphorylated. 

CC -!- SIMILARITY: TO MAP1A. 

CC -!- CAUTION: A C-terminal fragment of this protein (residues 1597 to 
cc 2459) was originally described as neuraxin in Ref.3. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; U52950; AAB17068.1; 

DR EMBL; X60370; CAC16162.1; -. 

DR EMBL; X16623; CAA34620.1; ALT_SEQ. 

DR PIR; A56577; A56577. 

DR InterPro; IPR000102; MAPlB_neuraxin . 

DR Pfam; PF00414; MAPlB_neuraxin; 10. 

DR PROSITE; PS00230; MAP1B_NEURAXIN; 8. 

KW Microtubule; Repeat; Phosphorylation. 

FT CHAIN ? 24 59 MAPI LIGHT CHAIN LCI. 



FT REPEAT 1869 1885 MAP IB 1. 

FT REPEAT 1886 1902 MAP IB 2. 

FT REPEAT 1903 1919 MAP1B 3. 

FT REPEAT 1920 1936 MAP1B 4. 

FT REPEAT 1937 1953 MAP IB 5. 

FT REPEAT 1954 1970 MAP IB 6. 

FT REPEAT 1988 2004 MAP IB 7. 

FT REPEAT 2005 2021 MAP IB 8. 

FT REPEAT 2022 2038 MAP1B 9. 

FT REPEAT 2039 2055 MAP IB 10. 

FT DOMAIN 559 1035 GLU-RICH. 

FT DOMAIN 588 786 LYS-RICH (HIGHLY BASIC, CONTAINS MANY 

FT KKEE AND KKEI/V REPEATS) . 

FT DOMAIN 2224 2312 LYS-RICH. 

FT CONFLICT 127 127 M -> V (IN REF. 1). 

FT CONFLICT 140 140 T -> S (IN REF. 1). 

FT CONFLICT 2112 2112 R -> K (IN REF. 3). 

FT CONFLICT 2169 2169 L -> I (IN REF. 3). 

SQ SEQUENCE 2459 AA; 269497 MW; 2E3F6872DEDB8BA2 CRC64 ; 

Query Match 5.0%; Score 291.5; DB 1; Length 2459; 

Best Local Similarity 20.0%; Pred. No. 0.0013; 

Matches 220; Conservative 167; Mismatches 456; Inciels 255; Gaps 44; 

QY 30 TEPEDEEDEEEEEDEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPA 8 9 

:i I I!:!" M I : I : : : | | II I | | : : | : 
Db 1008 SEEEGEEEEDKAEDAREEDHEPDKTE AEDYVMAWDKAAEAGVTEDQYDFL 1058 

Qy 90 PRGPLPAAPPAAPERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGAS 149 

I : I I HI II::: | | | | || 
Db 1059 GTPAKQ PGVQSPSREPASSIHDETLPGGSESEAT AS 1094 

QY 150 PLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGN TVS 206 

I I M I : : | | I I : I III 

Db 1095 DEENREDQPEEFTAT SGYTQST IEISSEPTPMDEMSTPRDVMTDETNNEETES 1147 

Qy 207 SGQE DFPSVLLETAASLP SLSPLS TVS FKEHG YLGNL SAVS S S EGT I E 254 

II : I I II I : II I : I : | : | ( 

Db 1148 PSQEFVNITKYESSLYSQEYSKPWAS FNGLSDGSKTDATDGRDYNASASTISPPSSMEE 1207 

Qy 255 ETLNEAS KELPERATNPFVNRDLAEF--SELEYSEMG 289 

: : : : : I : : : : I : I : I : I : : I 

Db 1208 DKFSKSALRDAYRPEETDVKTGAELDIKDVSDERLSPAKSPSLSPSPPSPIEKTPLGERS 1267 

QY 290 SSFKGSPKGESAI LVENTKEEVIVR SKDKEDL VCSAALHSP 330 

: I I :M: :| : :| I ::: I :| I: 

Db 1268 VNFSLTPNEIKASAEGEATAWSPGVTQAWEEHCASPEEKTLEWSPSQSVTGSAGHTP 1327 

Qy 331 -QESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFK PFEQAWEVKD 380 

•II : I I : : II |: : I | :: | | 

Db 1328 YYQSPTDEKSSHLPTEVT ENAQAVPVS FEFTEAKDENERS SIS PMDE- - PVPD 1378 

Qy 381 TYEGSRDVLAARANVESKVDRKCLED— SLEQKSLGKDS EGRNEDASFPSTPEPVK 434 

: II: : II I : 1:11: I ||:| I || 

Db 1379 SESPIEKVLSPLRSPPLIGSESAYEDFLSADDKALGRRSESPFEGKNGKQGFSDKESPVS 1438 



Qy 



435 DSSRAYITCAS FTSATESTTANTFPLLEDHTSENKT 



DEKKI 



475 



I si : I I | : | I : | | | | : | : 

Db 1439 D LTSDLYQDKQEEKRA.GFIPIKEDFSPEKKASDAEIMSSQS7VLALDERKLGGDGS 14 93 

Qy 476 EERKAQIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAV-- 520 

I: I I I I:: I :|: I : :: |: |:| 
Db 149 4 PTQVDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGAEDT Y S HME GVAS VS T AS VAT 1550 

Qy 5 21 SNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQES-LYPTAQLCP S 576 

I : M I I I : : I : | : : | : | | | | | : : | ; : | | | 

Db 1551 SSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSKEECPRPMS 1609 

577 FEEAEATP SPVLPDIVMEAPLNSLL P SAGAS WQ 610 

: : i I : : I : I : I I I : I I : : 

Db 1610 ISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSLAMDFSRQSPDHPTVGAGMLH 1669 

Qy 611 PSVSPLEAPPPVSYD SIKLEPENPPPYEEAMNVALKALGTKEGIKEPESFN 661 

: : IM I | : | ||:::::: : | : : 

Db 1670 ITEN— GPTEVDYSPSDIQDSSLSHKIPPTEEPSYTQDNDLS-ELISVSQVEASPSTSS 1725 

Qy 662 AAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEP 721 

I II I ::| I I I I: :| : | :| || : | : 
Db 1726 AHTPS QIASPLQEDTLSDWPPRDMSLYASLASEKVQSLEGEKL SPKSDI 1775 

Qy 722 VDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYL-ESFQP 780 

I : I I :: III I :| : : :| | | :: | 

Db 1776 SPLTPRESSPTYSPGFSDSTSGAKES-TAAYQTSSSPPIDAAAAEPYGFRSSMLFDTMQH 1834 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

: I : : I : : : : I : : | | I : I : : : : | | | : 

Db 1835 HLALSRDLTTSSV EKDNGGKTPGDFNYAYQKPESTTESPDEEDYDYE — SHEKTIQ 1888 

Qy 84 1 I IDEFPTFVSAKDDSPK LAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKN 896 

I : : : I I : I : I : : I | | : - : | 

Db 1889 AHDVGGYYYEKTERTIKSPCDSGYSYETIEKTTKTP EDGGYS CEITEKT 1937 

Qy 897 IYPKDEVHVSDEFSENRSSVSKAS ISPSNVSALEPQTEMGSIVKSKSL 944 

: I I I I I : : I : : : : | : | 

Db 1938 TRTPEEGGYSYEISEKTTRTPEVSGYTYEKTERSRRLLDDISNGYDDTEDGGHTLGDCSY 1997 

Qy 945 TKEAEKKLPSDTEKEDRS 962 

: I : I : I II I 
Db 1998 SYETTEKITSFPESESYS 2015 



RESULT 14 
MAP B_HUMAN 

ID MAP B_HUMAN STANDARD; PRT ; 2468 AA. 

AC P46821; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) [Contains: MAPI light chain 
DE LCI] . 
GN MAP1B. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



ox 

RN 

RP 

RC 

RX 

RA 

RT 

RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

SQ 



NCBIJTaxID=9606; 
[1] 

SEQUENCE FROM N.A. 

TISSUE=Fetal brain; 

MEDLINE=95104 835; PubMed=78 0 62 12 ; 

Lien L.L., Feener C, Fischbach N., Kunkel L.M.; 

"Cloning of human microtubule-associated protein IB and the 

identification of a related gene on chromosome 15."; 

Genomics 22:273-28 0(1994). 

-!- FUNCTION: The function of brain MAPS is essentially unknown. 

Phosphorylated MAP IB may play a role in the cytoskeletal changes 
that accompany neurite extension. Possibly MAP1B Binds to at least 
two tubulin subunits in the polymer, and this bridging of subunits 
might be involved in nucleating microtubule polymerization and in 
stabilizing microtubules. 

-!- SUBUNIT: 3 different light chains, LCI, LC2 and LC3, can associate 
with MAPI A and MAP1B proteins. 

-!- DOMAIN: Has a highly basic region with many copies of the sequence 
KKEE and KKEI/V, repeated but not at fixed intervals, which is 
responsible for the binding of MAPlB to microtubules. 

-!- PTM: LCI is coexpressed with MAP IB. It is a polypeptide generated 
from MAP IB by proteolytic processing. It is free to associate with 
both MAPI A and MAPlB. It interacts with the amino- terminal region 
of MAPlB (By similarity) . 

-!- SIMILARITY: TO MAP1A. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; L06237; AAA18904.1; -. 
Genew; HGNC:6836; MAPlB. 
MIM; 157129; 

GO; GO: 0005875; C : microtubule associated complex; TAS . 
InterPro; IPR000102; MAPlB_neuraxin . 
Pfam; PF00414; MAPlB_neuraxin; 10. 
PROSITE; PS00230; MAP1B_NEURAXIN; 6. 
Mi crotubule ; Repeat ; Phosphorylation . 
CHAIN 
REPEAT 
REPEAT 
REPEAT 
REPEAT 
REPEAT 
REPEAT 
REPEAT 
REPEAT 
REPEAT 
REPEAT 
DOMAIN 

SEQUENCE 2468 AA; 270618 MW; 54 0839CBDF0 9D4 61 CRC64; 



7 


2468 


MAPI LIGHT CHAIN LCI. 


1878 


1894 


MAPlB 1. 


1895 


1911 


MAPlB 2. 


1912 


1928 


MAPlB 3. 


1929 


1945 


MAPlB 4. 


1946 


1962 


MAPlB 5. 


1963 


1979 


MAPlB 6. 


1997 


2013 


MAPlB 7. 


2014 


2030 


MAPlB 8. 


2031 


2047 


MAPlB 9. 


2048 


2064 


MAPlB 10. 


589 


790 


LYS-RICH (HIGHLY BASI 
KKEE AND KKEI/V REPEA 



Query Match 4.9%; Score 288; DB 1; Length 2468; 

Best Local Similarity 20.2%; Pred. No. 0.0017; 

Matches 251; Conservative 166; Mismatches 441; Indels 386; Gaps 56; 

13 STDSPPR— PPPAFKYQFVTEPEDEEDEEE EED EEEDDED 50 

HI I: | : :||| ::: |;| :|| , 

Db 625 ATDVKPKAAKEKTVKKETKVKPEDKKEEKEKPKKEVAKKEDKT PI KKEEKPKKEEVKKEV 684 

51 LEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWER 110 
=l= : 1=1 I : :: I : : ::: |: 
Db 685 KKEIKKEEKK EPKKEVKKETPPKEVKKEVKKEEKKEVKKE EK 72 6 

Qy HI SPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRG— 168 

' HI : I | : | | | | |:| | | : | 

Db 727 EPKKEIKKLPKDAKKSSTPLSEAKKPAALKPKVPKKEESVKKDSVA AGKPKEKGKI 782 

Qy i 6 9 SGSVDETLFAL PA ASEPVI PS SAEKIMDLME 199 

' I : I II I :: I : | | 

Db 7 83 KVIKKEGKAAEAVT^AVGTGATTAAVMAAAGIAAIGPAKELEAERSLMSSPEDLTKDFEE 842 

Qy 200 QPGNTVSSGQEDFPSV-LLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLN 258 

I :: I : | : | | | : :: : I I | | | | 

Db 843 LKAEEVDVTKDIKPQLELIEDEEKLKETEPVEAYVIQKEREVTKGPAESPDEG-ITTTEG 901 

Qy 259 EASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAILVENTK-EEVIVRSKD 317 

1 I I : : : : I : I : I : | | : I : | | : | 

Db 902 EGECEQTPEELEPVEKQGVDDIEKFE — DEGAGFEES — SETGDYEEKAETEEAEEPEED 957 

Qy 318 KEDLVC-SAALHSP QESPVGKEDRWSPEKTMDIFNEMQMSV 358 

l ; II M: Ml :|| :|| | : || , 
Db 958 GEEHVCVSASKHSPTEDEESAKAEADAYIREKRESVASGDDRA- — EEDMD EAIEKG 1011 

Qy 359 VAPVREEYADFKPFEQAWEVK-DTYEGSRDVLAARANVESKVDRKCLEDSLEQ 410 

I I I I I : : : I : : : I I : : | I : | | : | : 

Db 1012 EAEQSEEEADEE DKAEDAREEEYEPEK — MEAEDYVMAWDKAAEAGGAEEQYGFLTT 1067 

Qy 411 — KSLGKDSEGR NEDASFPSTPEPVKDSSRAYITCAS FTSATESTT 454 

III IN :| : I I :| : : :| :| : 

Db 1068 PTKQLGAQSPGREPASSIHDETLPGGSESEATASDEENREDQPEEFTATSGYTQSTIEIS 1127 

Qy 455 AMTFPL LEDHTSENKTDEK KIEERKAQIITEKTS-PKTSNPFLVAV 499 

: I: : | |: :|: I | | | 

Db ll 28 SEPTPMDEMSTPRDVMSDETNNEETESPSQEFVNITKYESSLYSQEYSKPADVTPLNGFS 1187 

Qy 5 00 QDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDL 556 

: ' : I N |: : | : | : | | :| ::: | : | : 
Db 118 8 EGSKTD ATDGKDYNASASTISPPSSMEED KFSRSALRDAYCSEVKASTTLDIKDS 1242 

Qy 557 — VQTSEAIQESLYPTAQLCP SFE EAEATPSPVLPDIV 592 

: I I : I I : I : | I I I : I I I : : 

Db 1243 ISAVSSEKVSPSKSPSLSPSPPSPLEKTPLGERSVNFSLTPNEIKVSAEAEVAPVSPEVT 1302 

Qy 593 ME APLNSLLPSAG ASWQ 610 

I :| I: III |:: 

Db 1303 QEWEEHCASPEDKTLEWSPSQSVTGSAGHTPYYQSPTDEKSSHLPTEVIEKPPAVPVS 1362 

Qy 611 PSVSPLEAPPPVSYDSIK— LEPENPPP YEEAMNVALKALG 64 9 



I N I : : I I I I : | | | | | | : : | | | 

Db 1363 FEFSDAKDENERASVSPMDEPVPDSESPIEKVLSPLRSPPLIGSESAYESFLSADDKASG 1422 

Q y 65 ° TKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPD-FSNYSEIA KF 701 

Db 1423 RGAESPF EEKSGKQGSPDQVSPVSEMTSTSLYQDKQ 1458 

Qy 702 EKSVPEHAELVEDSSPE SEPVDLFSDDSI PEVPQTQ EEAV 741 

1 = 1 = 111 1:1 : : :| || | : 

Db 1459 EGKSTDFAPIKEDFGQEKKTDDVEAMSSQPALALDERKLGDVSPTQIDVSQFGSFKEDTK 1518 

QY 742 MLMKE SLTEVSETVAQ HKEERLSASPQELG KPYLESFQPNLHSTK 786 

I : I I I I I I I : || || : : | : , : | , 
Db 1519 MS I S EGT VS DKS AT P VDEGVAEDT YS HMEGVAS VSTAS VAT S S FPEPTTDDVS P S LH 1575 



Qy 787 DAASNDIPTLTK-KEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEF 845 
I I I: : :|: : : | : :: |||: : 
1576 --AEVGSPHSTEVDDSLSVSWQTPTT-FQETEMSPSKEECPR 1615 



Db 



QY 846 PTFVSAKDDSPKLAKEYTDLE— VSDKSEIANIQSGADSLPCLELPCDLSFKN 896 

I : I I 1 I I I I I : : I : : I : : I : I : | | | | | :: 
Db 1616 PMSISPPDFSPKTAK5RTPVQDHRSEQSSM-SIEFGQES-PEQSLAMDFSRQSPDHPTVG 1673 

Qy 897 IYPKDE-VHVSDEFSENRSSVSKASISPSNV 926 

I I :| : I Ml: I I I 

Db 1674 AGVLHITENGPTEVDYSPSDMQDSSLSHKIPPMEEPSYTQDNDLSELISVSQVEASPSTS 1733 



QY 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE 970 
II | : : : | : : : | : | : I I I I I : : | 

1734 SAHTP-SQIASPLQEDTLSDVAPPR DMSLYASLTSE 1768 



Db 



RESULT 15 
MAPB_MOUSE 

ID MAPB_MOUSE STANDARD; PRT; 24 64 AA 

AC P14873; 

DT 01-APR-1990 (Rel. 14, Created) 

DT 01-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) (MAPI. 2) (MAPI (X) ) 

DE [Contains: MAPI light chain LCI]. 

GN MAP IB OR MTAP1B OR MTAP5 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A., AND DOMAIN. 

RC STRAIN=Swiss Webster; TISSUE=Brain; 

RX MEDLINE=90094539; PubMed=24 8 0963; 

RA Noble M. , Lewis S.A. , Cowan N.J.; 

RT "The microtubule binding domain of microtubule-associated protein 

RT MAP1B contains a repeated sequence motif unrelated to that of MAP 2 

RT and tau. "; 

RL J. Cell Biol. 109:3367-3376(1989). 

CC -!- FUNCTION: The function of brain MAPS is essentially unknown. 

CC Phosphorylated MAP IB may play a role in the cytoskeletal changes 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 



that accompany neurite extension. Possibly MAPlB Binds to at least 
two tubulin subunits in the polymer, and this bridging of subunits 
might be involved in nucleating microtubule polymerization and in 
stabilizing microtubules. 

-!- SUBUNIT: 3 different light chains, LCI, LC2 and LC3, can associate 
with MAPI A and MAP1B proteins. 

-!- DOMAIN: Has a highly basic region with many copies of the sequence 
KKEE and KKEI/V, repeated but not at fixed intervals, which is 
responsible for the binding of MAP1B to microtubules. 

-!- PTM: LCI is coexpressed with MAPlB. It is a polypeptide generated 
from MAP1B by proteolytic processing. It is free to associate with 
both MAPI A and MAPlB. It interacts with the amino-terminal reaion 
of MAPlB. 

SIMILARITY: TO MAP1A. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; X51396; CAA35761.1; -. 
PIR; S07549; QRMSP1. 
MGD; MGI: 1306778; Mtaplb. 

GO; GO: 0016358; P: dendrite morphogenesis; IMP. 
GO; GO: 0001578; P .'microtubule bundling; IMP. 
InterPro; IPR000102; MAPlB_neuraxin . 
Pfam; PF00414; MAPlB_neuraxin; 10. 
PROSITE; PS00230; MAP1B_NEURAXIN ; 7. 
Microtubule; Repeat; Phosphorylation. 



FT 


CHAIN 




2464 


MAPI LIGHT CHAIN LCI. 


FT 


REPEAT 


1874 


1890 


MAPlB 1. 


FT 


REPEAT 


1891 


1907 


MAPlB 2. 


FT 


REPEAT 


1908 


1924 


MAPlB 3. 


FT 


REPEAT 


1925 


1941 


MAPlB 4. 


FT 


REPEAT 


1942 


1958 


MAPlB 5. 


FT 


REPEAT 


1959 


1975 


MAPlB 6. 


FT 


REPEAT 


1993 


2009 


MAPlB 7. 


FT 


REPEAT 


2010 


2026 


MAPlB 8. 


FT 


REPEAT 


2027 


2043 


MAPlB 9. 


FT 


REPEAT 


2044 


2060 


MAPlB 10. 


FT 


DOMAIN 


589 


787 


LYS-RICH (HIGHLY BASIC, CONTAINS MANY 


FT 








KKEE AND KKEI/V REPEATS) . 


SQ 


SEQUENCE 


2464 


AA; 270408 


MW; FBD3DD99CFDBDA87 CRC64 ; 



Qy 

Db 

Qy 



Query Match 4.9%; Score 284; DB 1; Length 2464; 

Best Local Similarity 20.7%; Pred. No. 0.0025; 

Matches 233; Conservative 159; Mismatches 424; Indels 312; Gaps 51; 

31 EPEDEEDEEEEEDEEED DEDLEELE VLERKPAAG LSAAAVP 71 

I I I: : I I : I I I I :: || 1:1 

1009 EAEQSEEEGEEEDKAEDAREEGYEPDKTEAEDYVMAVADKAAEAGVTEEQYGYLGTSAKQ 1068 



72 PAAAAPLLDFSS DSVPPAPRGPLPAAPPAAPERQP- 

I :l : :| :::| |: | || 



-SWERSPAAPA 116 
I : I 



Db 



1069 PGIQSPSREPASSIHDETLPGGSESEATASDEENREDQPEEFTATSGYTQSTIEISSEPT 1128 



QV H7 P SLPPAAAVLPSKLPEDDEPPARP PPPPPAGASPLAEPA— 155 

I I : I: I |:: I I : | : | : 

Db 1129 PMDEMSTPRDVMSDETNNEETESPSQEFVNITKYESSLYSQEYSKPAVAS FNGLSEGSKT 1188 

Qy 156 AP P S T PAAP KRRG S GS VD ETL FAL PA AS E PVI P S S AEKIMDLMEQ 200 

llhl |::| |: | : I : :||| :|: : 

Db 1189 DATDGKDYNASASTISPP SSMEEDKFSKSALRDAYCSEEKELKASAE — LDIKDV 1241 

^ 201 PGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEA 260 

: I Mill 111:1 
Db 1242 SDERLS PAKSPSLSP SPPSPIEKT 1265 

251 SKELPERATNPFVNRDLAEFS ELEYSEMGS S FKGS PKGESAI LVEN- -TKEEVI VR 314 

I II: I II |:: | | : || |:: |: : || : 

Db 12 66 PLGERSVN FSLTPNEIKVSAEGEARSVSPGVTQAWEEHCASPEEKTLE 1314 

QY 315 SKDKEDLVCSAALHS P-QES PVGKEDRWS PEKTMDI FNEMQMS WA- PVREE YADFK- - 370 

I ; l Isl Ml : II: : I: : | || |::: | 

Db 1315 WSPSQSVTGSAGHTPYYQSP TDEKSSHLPTEVSENAQAVPVSFEFSEAKDE 1366 

QY 371 PFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLED--SLEQKSLGKDS 417 

I :: I I : M: : II I : I II: I 

Db 1367 NERASLSPMDE— PVPDSESPVEKVLSPLRSPPLLGSESPYEDFLSADSKVLGRRSESPF 1424 

Qy 418 EGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKT 470 

H = l II 111= : : I : MM III 

Db 1425 EGKNGKQGFPDRESPVSDLT STGLYQDKQEEKSTGFIPIKEDFGPEKKTSDVETMS 1480 

Qy 471 DEKKI EERKAQI ITEKTSPKTSNPFLVAVQDSEAD 505 

MM: I: I I - I |:: | | :| 

Db 1481 SQSALALDERKLGGDVSPTQIDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGV--AEDT 1538 

Qy 506 YVTTDTLSKVTEAAV--SNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAI 563 

I : : : I : I : I Mil | | | : : | : | : : | . | | | 

Db 1539 YSHMEGVASVSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTF 1597 

Qy 564 QES-LYPTAQLCP SFEEAEATP SPVLPDIVMEAPLNSLL — 601 

I I: : M : II I : :| | : : | : | : | 

Db 1598 QETEMSPSKEECPRPMSISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSFAMD 1657 

Qy 602 PSAGASWQPSVSPLEAPPPVSYD SIKLEPENPPPYEEAMNVA 644 

M II I I : : : I I I I I : I | | : : : : 

Db 1658 FSRQSPDHPTLGASVLHITEN— GPTEVDYSPCDIQDSSLSHKIPPTEEPSYTQDNDLS 1714 

Qy 645 LKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKS 704 

: : : I : M I I::' I : I |: :| 
Db 1715 -ELISVSQVEASPSTSSAHTPS QIASPLQEDTLSDWPPREMSLYASLA 1762 

Qy 705 VPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVML MKES LT EVS ETVAQHKE 760 

: I I : III: III I I I : | : | : | | | | 

Db 1763 SEKVQ— SLEGEKLSPKSDIS PLTPRESSPLYSPGFSDSTSAAKETAAAH — 1810 

Qy 761 ERLSASP— QELGKPY LESFQPNLHSTKDAASNDIPTLTKKEKI SLQMEEFN 810 

: I : I I Ml : : I :| : I : : : : | : : | | 

Db 1811 -QASSSPPIDAATAEPYGFRSSMLFDTMQHHLALNRDLTTSSV EKDSGGKTPGDFN 18 65 



Qy 811 TAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDSPK LAKEYTDLE 866 

I :: I ::: : | | | | : : : | | : | 

Db 1866 YAYQKPENAAGSPDEEDYDYE— SQEKTIRTHDWRYYYEKTERTIKSPCDSGYSYETIE 1923 

QY 867 VSDKSEIANIQSGADSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKAS— 920 

: ' : = I I::: I :| I I II : : I 

Db 1924 KTTKTP EDGG YTCEITEKTTRTPEEGGYSYEISEKTTRTPEVSGYTYEK 1972 

QY 921 ISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRS 962 

: : = : I : I : I : I : I I I I 

Db 1973 TERSRRLLDDISNGYDDTEDGGHTLGDCSYSYETTEKITSFPESESYS 2020 



Search completed: September 29, 2004, 18:14:37 
Job time : 27.6724 sees 



